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ABSTRACT 


This thesis is concerned with the modelling and performance analysis 
of systems which consist of concurrently acting components, an example of 
which is an asynchronous pipelined processor. The work is divided into two 
parts. 

In the first part, a suitable model is developed for describing the 
structure of asynchronous concurrent systems. In conventional automata 
theory, the finite-state machine model is used to describe the behavior 
of systems; the problem with this is that a large number of states re- 
sults when practical systems are modelled. In this thesis, each system 
component is modelled as a finite-state machine, and a system is viewed 
as an ensemble of interconnected finite-state machines.This has the ad- 
vantage that the size of a system model grows linearly rather than expo- 
nentially with the number of system components. A subclass of Petri nets 
known as SMD (State Machine Decomposable) Petri nets is identified in 
order to formalize the notions of finite-state machines and their inter- 
connection. For convenience, systems of interest are divided into two broad 
categories: 

(a) Deterministic, or decision-free. 

(b) Non-deterministic, or systems with decisions. 

SMD Petri nets are used to model both classes of systems; in addition, a 
subclass of Petri nets known as LSP Petri nets is used to model those 
deterministic systems that cannot be modelled by SMD Petri nets. 

The second part of the thesis is concerned with finding the computation 
rate of activities in real-world asynchronous concurrent systems. Practi- 
cal systems are constructed from devices which have a finite speed of ope- 
ration. Since Petri nets do not have time parameters as part of their 
definition, they can model the structure of systems but cannot be used to 
study their computation rate. The definition of Petri nets is augmented 
to model the speed of operation of a device in a system by assuming that 
the corresponding activity in the Petri net has a finite, non-zero time 
duration. The resulting nets are termed timed Petri nets, and methods are 
given for finding the computation rate of activities in timed SMD and LSP 
Petri nets. The results are applied to the analysis of several asynchro- 
nous systems drawn from areas within and outside the domain of computer 
systems . 
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CHAPTER 1 


INTRODUCTION 


1.1 Background 


The philosophy of this thesis can be described very effectively by the 
following quotation from T.C. Chen [C2]: 


In the quest for performance above and beyond that deliverable 
by hardware componentry, two alternative multiprocessing ap- 
proaches to computer design can be taken. One is to subdivide 
each oncoming job among many identically constructed mechanisms, 
and is commonly referred to as parallelism. The other alternative 
is to develop a collection of specialized mechanisms capable of 
working simultaneously to form a general purpose organization; 


this is commonly called overlap, and an extreme form of this is 
called pipelining. 


It is well known that present day machines are very wasteful in terms of 
resource utilization. One of the original arguments for time-shared 
multiprogrammed systems was the fact that they could permit better utiliza- 
tion of system resources than a batch-processing system by overlapping the 
operation of the processor, primary storage and secondary storage. The 
processor in contemporary computer systems is treated as a resource unit 
which is allocated to a user job or a task within a user job. Each 
processor typically consists of smaller processing units like adders, 
multipliers, an instruction fetch unit and an instruction decode unit, 

all of which can potentially be operated concurrently with each other. 

If it were possible to achieve this degree of concurrency, a much greater 


processing rate could be realized as a consequence. Of course, the over- 


iy 


all throughput of the machine depends not only on the*instruction pro- 
cessing rate of the processor, but ied on factors like the speed of 

the main memory, its degree of interleaving, the type of secondary 

storage and finally on issues like the job mix and the scheduling strategy 
used. But by suitably reconfiguring main and secondary memory, the bene- 
fits of added concurrency in the processor could be realized as added 
throughput in the overall system. In most contemporary processors, over- 
lapped operation of functional units within the processor is restricted 

to concurrent fetching and decoding of instructions. Many systems can be 
run in a multiprocessor configuration so that real parallelism is possible. 
However, this parallelism is between user jobs or between different tasks 
of the same user job, a situation which we term macro-parallelism. Dennis [D3] 
has advocated a computer organization in which parallel operation is pos- 
sible right down to the level of instructions in a computer program or a 
user task, We term this micro-parallelism and it is clearly impractical 
on contemporary machines because of the overhead involved in switching the 
processor between instruction streams. 

Jack Dennis has looked at the issues involved in designing a computer 
system to support micro-parallelism, and readers interested in his pro- 
posals for a memory organization and a representation scheme for programs 
and information structures are referred to his paper [D3]. We quote his 
remarks about the processing hardware: 


The organization of the processing hardware is intended to per- 
mit extensive sharing of multiple specialized cells by many 
computations to ensure statistically high utilization. It is 
envisioned that there be tens to hundreds of units of each cell 
class, operating independently and asynchronously using a ser- 


vice on demand principle of control. 
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The control needed in such a large complex system would be too formidable 
to tackle with a centrally clocked or a synchronous organization. Dennis 
has rightly pointed out the need for asynchronous operation of such a 
system. Each "cell" in Dennis' system corresponds to a functional unit 
of a certain type, like an adder or an instruction decoder, and the opera- 
tion of a cell type will be overlapped or pipelined with the operation of 
other cell types. Thus, by a combination of parallelism and overlap, a 
statistically high instruction throughput should result. We will refer to 
such systems as “asynchronous concurrent systems" and will have occasion 
to use this term often in the course of the thesis. Dennis and Patil [Dl, 
D2] have addressed themselves to the problem of evolving tools for the 
description and implementation of such systems. They have made much 
progress, and their efforts have resulted in a systematic technique for 
both the description of asynchronous concurrent systems together with a 
methodology for their realization using a set of elementary modules [D1, 
D2]. In this thesis, we would like to address ourselves to the question 
of modelling such systems with a view to predicting their performance. 
The performance of an asynchronous processor such as visualized by Dennis 
will depend upon several factors, some of which are listed below: 
a) The number of functional units of each type. 
b) The speed of operation of each functional unit. 
c) Statistical properties of user jobs, e.g., their degree of 
parallelism, relative frequencies of the different instructions. 
d) Gross statistical properties of user jobs, i.e., job arrival 
rates, mean length of user jobs. 
Before we can incorporate these factors into a model for performance 


evaluation of asynchronous computer organizations, we must come up with a 
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suitable descriptive tool or model which can represent the structure of 
such systems in a manner that is both simple and accurate. In particular, 
factors like (a) and (b) should be readily incorporated into the model. 
Models for asynchronous systems as they currently stand can successful- 
ly describe the structure of asynchronous systems, i.e., the structure 
and interconnection of the parts of the system. However, they do not 
have a way of incorporating information about factors like (b) and (c) 
as part of their definition. In this thesis, we have developed a model 
in which factors (a), (b), and (c) can be described in a natural and 
simple way. Thus, given a description of the structure of the system 
and the speed of operation of its parts together with statistics on the 
utilization each part, we can obtain a measure of the throughput of the 
system. If the system we are considering is an asynchronous pipelined 
processor, we can obtain a measure of its processing rate, given that we 
have information of types (a), (b) and (c) available to us. The actual 
throughput of such a processor when connected to memory units and in the 
presence of user jobs is not easy to find. We will not study issues of 
type (d) in connection with the performance of asynchronous computer 
systems, but will concern ourselves with finding an index of performance 
which we will call its information processing capacity or computation 


rate. 


Our approach will be to study an existing model for asynchronous sys- 
tems and explore in depth its applications to the modelling of various 
types of concurrent systems, including pipelined organizations in which 
parallelism may be present in each stage. This model, while adequate for 
describing the structure of asynchronous systems, does not contain in- 


formation regarding the speed of operation of system components or any 
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information about the statistical utilization of the parts of a system. 
We show how such information can be incorporated into the model. The 
Gidoakitee of this augmented model are studied, and a technique is given 
for analyzing the throughput rate of a large variety of asynchronous 
processing or computing systems. The model we have developed and the 
analysis techniques for it were motivated by the desire to study perfor- 
mance issues in asynchronous computer systems. However, the work can be 
applied with equal facility to analyzing numerous systems outside the 
realm of computer systems; the best example is that of an assembly line, 
and we shall interchangeably talk about pipelined processing systems or 
assembly lines, because both share fundamental characteristics such as 
overlapped and parallel operation. In the next section, we pursue the 
modelling of asynchronous systems in some depth, and illustrate the 


spirit and flavor of the thesis by concrete examples. 


Py 


1.2 Modelling Asynchronous Systems 


Our study of various types of asynchronous concurrent systems has led 
us to the conclusion that, by and large, we would like to distinguish 
between two broad classes of systems: 

a) deterministic 
b) non-deterministic (i.e., having decisions). 


Let us explain briefly what we mean by this distinction. 


Deterministic Systems: We will call an asynchronous system deterministic 


if during the course of operation of the system, there is never a 
situation when a decision has to be made between alternative courses of 
action in the system. An example will explain what we mean. Consider 

an automobile assembly line in which only one kind of car is being manu- 
factured, and each car is made from similar components using the same 
sequence of assembly operations. Thus, in the course of operation of 
this system every assembly operation is needed for the assembly of each 
car, there being no difference in the sequence of steps needed for the 
manufacture of the fourth or eighth car output by the assembly line. To 
give another example, consider a pipelined floating-point adder, arranged 
in, say, three stages (see Fig. 1.2.1). Each stage performs a certain 
operation on the pair of operands input to the pipeline for addition. 
Thus, each operand pair that is absorbed at the input goes through the same 
sequence of operations before being output as a result. Such a pipelined 
system is also a deterministic system because no choice has to be made 
between alternative courses of action. So far we have looked at systems 
in which the objects being assembled (or added, etc.) go through an i- 


dentical sequence of operations. A more general class of systems consists 
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mk 


of those systems in which the objects being processed do not necessarily 
go through the same sequence of doweatians: but the operation of the 
system is still deterministic. An example of such a system is an assembly 
line for the manufacture of automobiles which alternately turns out two 
kinds of autos, one fitted with a 200 HP engine and the qther with a 350 
HP engine. We shalt saa that the car with the mare powerful engine has 
to be suitably braced and fitted with extra gadgetry, so that after the 
engine has been bolted to the chassis, subsequent assembly operations per- 
formed on the two cars are different. Thus, even though there is a point 
in the system at which a decision has to be made, this decision cannot, 
for example, be made arbitrarily by an assembly line operator. The opera- 
tion of the system is, so to speak, “preordained,"“ i.e., there is no point 
during the operation of the system at which a choice can be exercised be- 
tween alternative courses of action. 

Let us now move on to the class of systems we have termed non-determin- 


istic systems. 


Non-Deterministic Systems: We will loosely define nom-deterministic 
systems as those in which there does exist a choice between alternative 
courses of action. An example of such a system is the assembly line of 
Figure 1.2.2, minus the restriction that the two types of automobiles be 
manufactured alternately. Thus, at the bay at which alternate routing 
of the two types of partially assembled autos was done, this routing can 
now be made random or controlled by some decision process other than a 
purely deterministic one. The decision could, for example, be made on 
the basis of up-to-the-minute customer orders received from car dealers. 


The operation of an assembly line such as this is said to be non-determin- 


istic. Let us now give a more computer-related example. Consider the 
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simplified model of a pipeline processor shown in figure 1.2.3. 


Processing for Type A Out put 
Queues 


Input Stage 1 Stage 2 A 
Queue 


of instruction? 


Figure 1.2.3. A Simplified Model of a Pipeline Processor for a Computer 
with Three Instruction Types. 


We shall assume that the instruction set of the computer has three types 
of instructions, termed types A, B, and C. Instructions are fetched at 
Stage 1, decoded at Stage 2 and processed according to their type at 
Stages 3 and 4. The reader will notice that we have chosen the same nota- 
tion for representing both the assembly line of figure 1.2.2. and the 


pipelined processor of figure 1.2.3. Both these types of systems, 
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together with a large variety of others, all have the basic characteristic 
that they consist of a set of parts eGsather with some mechanism for coor- 
dinating their operation... In addition, the system parts could be 
arranged to exhibit overlapped operation and parallelism. Let us ex- 
plain these two terms in the context of Figure 1.2.3. Each stage of the 
system is a part, whose operation can be overlapped with that of the 

other parts. We will say that the parts of the pipeline processor operate 
concurrently or in an overlapped fashion. We now come to parallelism. 
Suppose Stage 1 consists of two identical hardware units, each of which 
can independently fetch an instruction, and the two can operate concur- 
rently with respect to each other. Stage I will be said to have paral- 
lelism of degree 2. In general, any stage is said to have parallelism 


of degree n if it has n physical hardware units available for processing. 


Brief Statement of Thesis Problem: The problem we have addressed in 
this thesis is to study in some depth how to model asynchronous, concur- 
rent systems such as the ones shown above. We have chosen a formalism 
known as Petri nets [Hl1, H2] to express the sequencing relationships be- 
tween events in asynchronous systems. The problem with Petri nets, as 
with other models for asynchronous systems or parallel computation that 
we could have used is that they represent only the sequencing or "cause- 
and-effect" relationships between events in a system. Such a systems 
description is not adequate if we wish to study performance issues. 

For example, the assembly lines and the pipeline sceceiasie modelled in 
Figures 1.2.1. through 1.2.3. are all real world systems built from com- 
ponents or devices which take a certain amount of time to operate. Thus, 
the production of cars or the processing of instructions in an actual sys- 


tem is not instantaneous. The rate at which processed objects appear at 


wGe 


the output depends upon several factors, some of which we outlined in 


section 1.1 in connection with an asynchronous pipelined processor. 
Recapitulating, the throughput rate of an asynchronous processing system 
will depend upon two kinds of factors: 

(a) factors intrinsic to the system, like its structure, the 

organization of its parts and the speed of operation of each part. 
(b) extrinsic factors, like availability of items in the input queues, 
statistics on the type of items input to the system for processing. 

The only extrinsic factor we will model will be statistics on the out- 
come of decisions during the course of operation of the system. In 
connection with Figure 1.2.3, this would mean the relative probabilities 
of occurrence of each of the instruction types A, B and C during the 
operation of the pipelined processor. As the reader will note, this 
is equivalent to the relative frequency of use of the system parts which 
perform the processing of these instruction types. We will not model 
other extrinsic factors, like arrival statistics for items in the input 
queves. The throughput rate that we thus calculate for a processing 
system will represent the maximum rate possible, assuming that it is 
connected to a balanced configuration of primary and secondary memory. 

In the next section, we look at previous work in the areas of asyn- 
chronous systems, parallel computation and project scheduling which is 


relevant to the research presented in this thesis. 
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Considerable work has been done in modelling the sequencing or control 
aspects of asynchronous systems, but few workers have actually considered 
issues of timing or speed of operation. 

The work of R. Karp and R. Miller [Kl1, K2] is concerned with the 
development of a mathematical model called parallel program schemata 
to represent parallel algorithms. A set of uninterpreted operations is 
defined over a finite set of memory cells. With each operation is 
associated two subsets of the memory cells called the domain and range 
cells. Upon activation, called an initiation, an operation reads the 
current values in its domain cells and evaluates a function with these 
values as arguments. An unspecified time later, the results are deposited 
in the range locations, the latter action being called a termination. 
Control is conceptualized as a possibly infinite directed graph consisting 
of a set of control states (nodes) together with a transition function 
that specifies for each state initiation and termination pair the suc- 
ceeding control state. A computation is a sequence of initiations and 
terminations that corresponds to a defined path in the control graph 
(nalinieb ini from a designated starting state) and that satisfy other rules. 
With this formulation, parallel activation of operations is possible, 
but their work on the whole is more concerned with control structures 
for parallel programs that properly terminate than with the issues in- 
volved in being able to represent continuously operating asynchronous 
concurrent systems. 

Dijkstra [D4] considers a method by which asynchronous sequential 
processes may operate concurrently and communicate harmoniously. The 


processes are provided access to common integer values called semaphores. 
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The semaphores can be manipulated by means of two synchronizing primitives, 
the "P" and "V" operations which decrement and increment, respectively, 
the value of a semaphore by one. The P operation can be executed only 
when the current value of a semaphore is greater than zero. Conditions 
are investigated for an ensemble of interconnected sequential processes 
to operate without being deadlocked. 

The work of Holt and: Commoner[H2,C1] is very significant and has: forn- 
ed the starting point of our thesis. Their model, called Petri nets, 
is very simple to understand, and consists of two types of nodes termed 
places and transitions. A set of directed arcs connects places to 
transitions or transitions to places. Markers, called "tokens" are put 
‘on places, and each token that is put on a place corresponds to the 
holding of one instance of the condition corresponding to that place. 
When every input place of a transition has a token on it, it can "fire." 
A firing causes one token to be removed from each input place, and a 
token to be added to each output place. Commoner has investigated 
conditions for a subclass of Petri nets to operate in a deadlock-free 
manner. Hack has done an extensive investigation of a subclass of 
Petri nets termed Free Choice Petri Nets [H1]. 

So far, we have discussed models for asynchronous systems in which 
no mention was made of real time. Two bodies of work exist in which 
real time issues are entered into for asynchronous concurrent systems. 

The first is the PERT network used in project scheduling [F1]. A 
PERT network is an acyclic directed graph with an input vertex and an 
output vertex. Ali arcs in the system lie on paths from the input vertex 
to the output vertex. Each arc denotes an activity in a project, and a 


method is given for calculating the shortest amount of time that it 
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takes to complete the project (the critical path), and also the earliest 
and latest time that any given activity can initiate in order that the 
project be completed in the shortest time poepibies While PERT networks 
can explicitly represent concurrent activity in a system, they do not 
have the power to represent systems which operate in a recurrent or 
repetitive fashion. Nor, for that matter, can they explicitly represent 
system components with parallelism within each component. Most important, 
there is no provision in the structure of PERT networks for representing 
a choice between alternative actions in a system. This makes the model 
inadequate for modelling the complex asynchronous systemg we would like 
to handle. 

Martin and Estrin at UCLA have studied a model of parallel computation 
called the program flowchart [B1, Ml, M2, M3]. Program flowcharts are | 
directed graphs consisting of nodes and arcs. Nodes represent operations 
in a computer program and arcs represent data paths between them. An 
operation can take place when some suitable logical combination (and, 
exclusive-or) of its incoming arcs have data values on them. When an 
operation takes place, it absorbs data values from its incoming arcs and 
puts data values on some logical combination of outgoing arcs. The model 
is extremely powerful and can represent decisions explicitly. Martin and 
Estrin are concerned with modelling parallel computer programs, with a 
view to finding their mean computation time, i.e., the amount of time it 
takes to execute a computer program from start to finish, For this pur- 
pose, they assume. each node to have a fixed (deterministic) time duration. 
A technique is given for finding the mean execution. time of well-formed 
program flowcharts. Although the model is capable of representing 


recurrent processes, Martin and Estrin have confined themselves, in their 
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analysis, to models of processes which operate in a "one shot" fashion. 
Also, they have not developed their model to a point where it can be 

used for modelling complex pipeline processors or assembly lines. We 
will show in the course of our work the added generality that is possible 


withour approach to modelling asynchronous systems. 
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1.4 Overview of Following Chapters 


1) It is shown how asynchronous concurrent systems can be looked upon 

as an ensemble of suitably interconnected finite state machines. Petri 

nets are introduced to formalize this notion, and also as a graphical 

tool for visualizing the structure of asynchronous systems. It is shown 

what it means for an asynchronous system to be well-formed in terms of 

the corresponding notions of liveness and boundedness for Petri nets. The 
two classes of asynchronous systems, deterministic and non-deterministic, 

are formalized. Boundedness is shown to be a decidable property for 

Petri nets, and liveness in turn is shown decidable for bounded nets. 

(Chapter 2). 

2) An inedepth discussion is given of the structure of asynchronous concurrent 
systems in terms of SMD (State Machine Decomposable) Petri nets. The 

notion of a "current assignment" is introduced for transitions in a Petri net, 
and a consistent Petri net is defined as one which can support a consistent current 
assignment. Subclasses of Petri nets are introduced, chiefly Event 

Graphs, LSP (Live, Safe, Persistent) Petri nets and SMA (State Machine 
Allocatable ) Petri nets. Deterministic and non-deterministic systems are 
studied in terms of these subclasses. (Chapter 3). 

3) Timed Petri nets are introduced. These are Petri nets in which a 
transition executes for a fixed non-zero time called its firing time. 

The maximum rate at which an event occurs in a system is its computation 
rate. The computation rate of a large class of deterministic systems 

is found. (Chapter 4). 

4) The general problem of finding the computation rate of non-deterministic 


systems is addressed. A bound is obtained for the computation rate of 


non-deterministic systems. Systems are identified for which this bound 


299% 


is achievable. (Chapter 5). 

5) Applications of our work to the modelling and analysis of real-world 
systems are given. (Chapter 6). 

6) Unsolved problems are given, together with recommendations for future 


research. (Chapter 7). 


CHAPTER 2 


PETRI NETS AND VECTOR ADDITION SYSTEMS 


2.1 ‘Informal Introduction to Petri Nets 


We will present the idea of the Petri Net model as an extension of 
the finite state machine model of conventional automata theory. The 
latter model is based on the assumption that we can abstract from a sys- 
tem a total system state, and the action of the system is described by 
the set of all allowable states and a set of rules that governs the 
transition of the system from one state to another. Let us give a simple 
example here to explain what we mean. 


Consider a erases of che spipetined Floaeing. Point aiden shown in 


Figure 2. 1. 1a). Assume that it can handle one pair of operands at a time 


and that tien it is parroreing an cudeue iow. on an operand pair, other 
operand pairs in its input queues must wait until it is through with the 
current operand pair. We can thus represent the operation of the stage 


of the pipelined adder as shown in Figure 2.1.1(a). 


Unit busy 


Ready to output result 


Figure 2.1.1(a) 
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The stage can be in one of three states, denoted by circles in Figure 
2.1.1(a) : 

(a) Ready for input operand pair. 

(b) Unit busy (processing an operand pair). 

(c) Ready to output result. 

Figure 2.1.1(a) is drawn using the notation of finite-state automata 
theory, and a transition from one state to another is denoted by an arc. 
For reasons that will become clear soon, we will insert into each arc in 
Figure 2.1.1(a) a bar to denote the transition from one state to another. 
Also we would like to indicate what state a system is in at any given 
instant of time. We will designate this by the presence of a marker on 


the circle corresponding to that state (see Figure 2.1.1(b)). 


Unit busy Done with processing 


Ready to output 
result 


Output result 


Ready for input 
operand pair 


Figure 2.1.1(b) 


The system shown in Figure 2.1.1(b) is im the state : "ready for 
input operand pair". When the system absorbs an input operand pair, it 


makes a transition to the state : "unit busy". Finally, when it is done. 


with processing the operand pair, it becomes ready to output the result 


and makes a transition to the state entitled "reddy to output result". 
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The system then outputs the result and becomes ready for the next input 
operand pair. This sequence of operations repeats. 

Each stage of the system can be modelled in this fashion. We now con- 
sider how to model a two-stage pipelined adder, given that each stage is 
modelled in this fashion. Once again, by using techniques of conventional 
automata theery, we see that the two-stage system can be represented by 
a finite state machine which is the cross-product of the two machines[H4]. 
Let us attempt to carry out this construction. We will assume that the 
states of the two machines are labelled a,b,c and a‘',b',c', respectively 


(see Figure 2.1.2). 
2 b' 


a 


Figure 2.1.2 


Figure 2.1.3 shows the resulting finite state machine. We see that it has 
nine states and a total of eighteen transitions between states. The 
problemg with this representation are the following: 

(a) The number of states grows as the product of the number of states 

in the individual stages. 


(b) The identity of the individual stages has been lost. For all we know, 
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this could be the state diagram of a nine-state system consisting of only 
one system component (or stage). 

The nine-state finite state machine apeeites the structure of our two- 
stage pipelined adder. We note that if the pipeline has n _ stages, the 
number of states in the finite-state machine used to represent it would 
be 3° ! To represent a five-stage pipeline, we would need 243 states. 
We clearly need a formalism which provédes more economical descriptions 
of concurrent systems. 

Bet us briefly discuss parallelism in a system like a pipelined adder. 
Suppose we consider a two-stage pipelined adder in which the first stage 
has five functional units in parallel, and can thereby support up to 
five concurrent computations. The second stage will be assumed to have 
eight parallel stages. We can represent parallelism of degree five in 
the first stage by placing a total of five tokens on the finite-state 
machine used to represent it. Similarly, we place eight tokens on the 
finite state machine used to represent the second stage. (See Figure 2.1.4). 
Each token is assumed to move from one state to another independently 
of all other tokens. The state of the first stage is now a vector 
( n(a), n(b), n(c) ), where each element of the vector represents the 
number of tokens on the corresponding place. Note that : 


n(a) + n(b) + n(c) 


{il 
) 


and 


n(a') + n(b') + n(c*) 


i] 
foe) 


In order to model the first stage as a finite state machine, 3° states 
are needed. Similarly, 38 states are needed for the second stage, giving 
t 


3° states in all for the cross product machine ! Suppose instead of 


attempting to model the above system as a finite state machine, we use 
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t 
fs 


Figure 2.1.3 Finite State Machine Model of Pipelined Adder 


the following artifiee : coalesce t, and ti together, givimg the dia- 


gram in Figure 2.1.5. The coalesced bar is relabelled 't'. 


The output terminal of stage 1 is thus made the input terminal of stage 

2. A transfer of an operand can take place when stage 1 is ready to out- 
put a result and stage 2 is ready for an input operand. In terms of the 

net in Figure 2.1.5, this is true when state c in stage 1 and state a’ in 
stage 2 each have at least one marker on them. Figure 2.1.6 shows a confi-— 
guration of the pipeline in.which an operand pair. can be transferred between 


the stages. Figure 2.1.7 shows the configuration that results after an 
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operand pair has been transferred. 


We are now ready to introduce some nomenclature. Each circle in the 
diagram of Figure 2.1.5 is called a place , and the bars are called trans- 
itions. The act of transferring an operand pair between stages was 


achieved by an action called the firing of transition t. 


Figure 2.1.4. 


a Figure 2.1.5. a’ 


Figure 2.1.6 shows a configuration of the system in which transition t can 


be fired. When t fires, a token is removed from each input place and 


added to each output place. (gee Figure 2.1.7). In system terms, each input 


-29- 


Before firing transition t 


Figure 2.1.6 


After firing t 


Figure 2.1.7 
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place to a transition represents a pre-condition that has to be satisfied 
(or hold) in order that the action corresponding to the eransi tion may 
take place. If there are several tokens on a place, it represents several 
instances of the holding of the corresponding condition. Thus, when the 
action corresponding to t occurs, it causes one instance of each pre- 
condition to cease and one instance of each output condition or post- 
condition to begin holding. When the finite-state machines representing 
Stages 1 and 2 are not connected, the stages they model are said to be 
mutually independent. In Figure 2.1.5, the finite state machines are said 
to be interconnected. Many other terms can be used instead of interconnec- 
ted stages, for example, cooperating or mutually synchronized stages, but 
we will continue to use the term interconnected. 

What we have done in the last few paragraphs is to introduce a way of 
modelling, in an economical way, systems of interacting parts in which 
overlapped operation and parallelism are both present. The terminology we 
have used is part of the definition of Petri nets, which are discussed more 
formally in the next Section. We have introduced the idea of Petri nets 
as a natural extension to the finite state machine model, and their advan- 
tages are obvious and considerable. 

We would like to continue our informal approach in order to introduce 
some important notions that we will need in understanding the structure 
of asynchronous systems. Consider the finite-state machine in Figure 2.1.1(b). 
The state machine is strongly-connected, i.e., from every state there 
exists a directed path to every other state. The state machine in Figure 
2.1.8 is not strongly-connected and we see that there is the possibility 


that the token can appear in place P3> after which there is no way for the 


ais 


token to appear in either place P, OF place Po: What this implies is that 


once the token appears in place P4° transitions t,t, and t, eannot fire. 


Figure 2.1.8 A finite state machine that is not strongly-connected . 


. 


We have seen that transitions can be used to represent operators in an actual 
system. In the systems we are interested in, we insist that every operator 
always be usable. A system component those operation can be represented by 


the finite state machine of Figure 2.1.8 has certain operators (represented 


by tpt, and t,) which are not used after some transient behavior of the 
system. In the steady state, transitions t, and t. fire alternately, over 


and over again. We will insist that each finite state machine be strongly- 
connected, and the reader can see that this is necessary for the composite 
system to satisfy the requirement that every operator always be usable in 
the course of operation of the system. If it so happens that some of the 
operators can never be used (i.e. their corresponding transitions can 


never be fired) , then those operators can be removed from the system 


without affecting its operation. Such operators are termed redundant. 
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The reader can verery that in the resulting net, pean terone t.> - and 
te are redundant. 

A Petri net such as the one shown in Figure 2.1.9(b) has no redundant 
transitions and will be ereaa live. On the other hand, in the net shown 


in Figure 2.1.9(c), transitions t_, t. and te are redundant, whereas all 


5° 6 
others can always be fired. Such a net is termed pseudo-live. In some 
Petri nets, the operation of the net may lead to a configuration in which 


no transition can be fired. Such a net is termed non-live. In Figure 


2.1.10(b) we show the construction of such a net from the two state machines 


of Figure 2.1.10(a). 


Figure 2.1.9(a) 
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Figure 2.1.9(c) 


In the configuration shown in Figure 2.1.10(b), either transition t, or ty 


can be fired. If t, is fired, the net attains a configuration in which no 


2 
further transitions can be fired (see Figure 2.1.10(c)). The net is an 
example of a non-live net, and the configuration shown in Figure 2.1.10(c) 
is a deadlocked configuration. 


What we have done so far is to motivate a formal study of Petri Nets, 


which we now proceed to do én the next section. 


Figure 2.1.10(a) 


Figure 2.1.10(b) Figure 2.1.10(c) 
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2.2 Formal Definition of Petri Nets and Relevant Concepts 


2.2.1 Petri Nets 


We are now in a position to introduce the reader to the formal defini- 
tion of Petri nets together with the related terainology that we will use 


in the rest of this thesis. 


Definition 2.2.1 A Petri Net Pis a three-tuple (P,T,A) 
where P is a non-empty set of distinctly-labelled places {p1, p2»---Pn!- 
T is a non-empty set of distinctly-labelled transit ions 
Lt eae eectas 
A is a relation; it corresponds to a set of ares, where each arc 
is either from a place to a transition or from a transition to 
‘pide : 


AS PXT U TXp, 


Definition 2.2.2 A marking M is a function such that M: P —# 1], where 7] 
is the set of non-negative integers. The non-negative integer associated 
with a place represents the token load of that place, or the number of 


tokens on it. 


A Petri net with a marking will be referred to as a marked Petri net 


(see figure 2.2.1 for an example). 


Notation and Terminology 


A node of a Petri net is either a place or a transition. We now intro- 
duce a convenient notation for the predecessor or successor nodes of any 


node in a Petri net. We will refer to it as the dot notation. 
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Figure 2.2.1. A marked Petri Net 


{p,t) 6 A is written p-t. 
{y]x.-y} is written x°*. 


{yly-x} is written °x. 


Example: In figure 2.2.1, -t2 = {p1, p2}- 


Ppi° = {t,, to}. 


The dot notation is also applied to designate the successor or pre- 
decessor set of a set of places or transitions. Thus, 
-{ta, t,} 7 {p1, P2> p3}. 


{p3, po} = {to, tz, t,}. 


A transition t in a Petri Net Pis said to be enabled iff every input 
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place P, € -t has at least one token on it. An enabled transition can be 

fired. When a transition fires, a token is removed from each input place 

and added to each output place pj € t+. Suppose M@ is the marking that 

results when an enabled transition t, fires at marking M. We write this 
vw £5 we, 

Now suppose transition t}, can be fired at marking M@. Let MD be the 

marking that results when transition t}, fires. We write 


M fa, ma Eby ub, Continuing in this fashion, let | 
us suppose that at every new marking that results when a transition fires, 


at least one transition can be fired. We can write this 
wm taema Eby eb -. ©... Ee yn, 


The sequence of transition firings t tpt, ..... tn is termed a firing 
sequence. If o= t,tp ......tn, we write 


n. 


t,t a 
iM a“b... 4 


n 
or 4 —2.—-e . 


Example. 


In -Figure 2.2.2(a) transition ty is enabled. Figure 2.2.2(b) shows 
the marking that results when transition to is fired at marking M. The 


reader can also verify that tot,t,5tyt3 is a firing sequence for the net. 


Definition 2.2.3: A marking ud is said to be reachable from marking ut 


if there exists a firing sequence ©. such that 


a ee, 
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Marking M Marking M' 
(a) (b) 


Figure 2.2.2, 


Definition 2.2.4: The forward marking class M of a marking M is the 


set of markings that is reachable from M. i.e., 
M= {0 | ao € * and Mm Spm}. 


Note: T* is the set of strings on the alphabet 


t= {t,, to, er re 


Notation: We have defined M as a function that assigns a token load to 
every place py € P. M can also be looked upon as a vector, theith element 


of which corresponds to the token load of the ith- place py in the Petri net. 
Example: The marking M of the net in Figure 2.2.2(d) can be written as 


M= (1, 1, 6, 0) 
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The forward marking class of M is seen to be 


M = {(1,1,0,0), (0,0,1,1), (0,1,1,0), (1,0,0,1)}. 


————— A cr 


Definition 2.2.5: A marking Mis live for a transition t iff for every 
< > 
marking mM! in the forward marking class M there exists a firing sequence 


which fires t. 


for every transition in the net. 


Definition 2.2.7: A marking M is bounded for a place p, iff there exists 


an integer N such that for every marking mic M, Mi(p) s N. If N= 1, the 


marking is safe for place p. 


Definition 2.2.8: A marking M is bounded (or safe) for the Petri net Pp 


iff M is bounded (or safe) for every place in the net. 


In Section 2.1, we pointed out that there exist marked Petri nets in 
which a marking is reached in. which no transitions can be fired, i.e., the 
net can reach a deadlocked configuration. Suppose this deadlocked con- 
figuration is called ul, and suppose that the net is in some configuration 


(i.e., has a marking) M. If qM—ayl then o is said to be a killing 


sequence for the net at the given marking M. A net. with a live marking 
has no killing sequence. 


There is one more important issue we would like to consider in this 
subsection. In Section 2.1, we pointed out that the only Petri nets we 
will use for modelling asynchronous concurrent systems will be nets which 
have a live marking. We would also like the marking for a Petri net to 


be bounded. This means that the only Petri nets we would like to consider 
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are strongly-connected nets. Let us see why. Consider a non-strongly 
connected Petri-net f, Then, there must exist in the net two portions of 
the net A and A* such that all arcs between them are directed from A to 
A’, as shown in Figure 2.2.3. Each of the nodes a and c in A can either 
be a place or a transition. Suppose a is a place. Then b must be a 
transition. Since the marking for the net is live, it means that b can 

be fired repeatedly. But each firing of transition b removes a token from 
place a. Now suppose we do not fire transition b at all. This would mean 
that all the tokens which were Seat oseis being removed by firings of b 
can now stay on place a, which means that the number of tokens on a can 


become unbounded. 


Figure 2.2.3 A Petri net that is not strongly-connected. 


Now suppose that a is a transition and b is a place. Since the marking 
for Pis live, we can fire transition a repeatedly, causing the token 


content of b to become unbounded. Thus, if ? is not strongly-connected, | 
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the token content of some place in p becomes unbounded. Now, since we 


are concerned only with nets which have a live, bounded marking, ail 


Petri nets we will consider will be strongly-connected, unless explicit - 
ly stated otherwise. 
In Section 1.2, we said that there are two broad classes of systems 
that we would like to distinguish between. We now pursue that line of 


thinking formally. 
2.2.2 Deterministic and Non-Deterministic Petri Nets 


We would like to make an important distinction between two classes of 
Petri nets, which we term deterministic and non-deterministic. First, we 


look at a structural condition of Petri nets known as conflict. 


Definition: Consider a Petri net P = (P,T,A). Two or more transitions 
t1,to, ......t, are said to be in conflict if there exists a place p such 
that Pre Pet2.caeee Ap-tk. . 


Note: 'A' denotes the logical "and" operaton. 


“Figure 2.2.4 
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In Figure 2.2.4 transitions t, and t, are in conflict at place p. 


1 2 
With the marking shown, a token in place p, can be removed by the firing 


1 or transition t,- Thus, for the given marking, 


we have a choice between firing either transition t or transition ty» 


and when either transition fires, the other ceases to be enabled. Before 


of either transition t 


we can go any further, we must make a few definitions. 


Definition 2.2.9: A marking M is persistent for a transition t in a 
Petri net Pif t has the property that once it is enabled at any markiug 
ul ¢ M, it cannot cease to be enabled by the firing of any transition 


other than itself. 


Definition 2.2.10: A marking M is persistent for a Petri net FP ite it 


is persistent for every transition t € 2, 


The net in figure 2.2.4 has the property that it has no marking which is 
persistent for transitions ty and t,. Such a net is termed.ia non-deter- 
ministic net, and represents a system in which there is a distinct choice 
between alternative actions (see section 1.2). In contrast, the net in 
figure 2.2.6 is a deterministic net or a net with a persistent marking. 

We now introduce some convenient ways of referring to the nets we have 
been discussing. 

A net with a Live, Bounded or a Live, Safe marking will be termed an 
LB or LS net, respectively. If a Live, Bounded marking is also Persistent 
we will call the Petri net an LBP net. A net with a Live, Safe, Persistent 


marking will be called an LSP net. The net in Figure 2.2.5 is an LSP net. 


Petri nets are a graphical representation of a mathematical system 
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‘Figure .2.2.5 A Petri net with a Hive, safe, persistent marking 


known as vector addition systems. In the next section, we use some results 
in vector addition systems to prove the decidability of certain issues in — 


Petri nets. 


2.3 Vector Addition Systems and their Relationship to Petri Nets 


In this section, we give a brief description of vector addition systems, 


but a more thorough coverage can be found in Karp and Miller [K2]. 


Definition 2.3.1: An r-dimensional vector addition system V is a pair 
V = (s,W) where 

() s€N, ne (0,1, ...}. 

(2) Wis a finite set of r-dimensional integer vectors 
W= {wpr-eesm} , w, € (041,42, ...}7 

The reachability set R(V) is the set of vectors given by 

+w 


R(V) = CAE =s+w +... + Wie where w,, € W. 


il ij 


EW, j= 1,2,.,0k. 


i2 

ae “ij 
b. stwiyptwiyt.. tw e 0, k = 1,2,...t. 

The reachability set is the set of all points that can be reached by some 


path from s using vectors from W and never leaving the first orthant. 


Example 2.3.1: As an example of a four element vector addition system, 
consider 
Vi= (3,w) 
where s = (1,1,0,0) 
We {w, = (-1,0,+1,0), wo = (-1,-1,+1,+1), W3 = (0,-1,0,+1), 
w, = (41,0,-1,0), w, = (0,+1,0,-1)}. 
The reachability set R(V) of this vector addition system consists of 


four vectors {(1,1,0,0), (0,0,1,1), (0,1,1,0), (1,0,0,1)}. 


Notation 


Let us denote the vectors in the reachability set by Vy? 


245% 


i.e., R(V) = ty,» Yor see Yale 
Also, if yy is a vector, then (ys), will denote the niga element of Y;° 
We will draw a directed graph with the elements of R(V) as vertices, 
and an arc from vertex Ys to y; if there exists a vector Wy € W such that 
y; ge st + Wy° 
Each arc will be labelled with its corresponding vector we above, and the 
resulting diagram shows at a glance the vectors in the reachability set 
and: how they can-be reached from one another. We’will-call-this diagram 
the reachability diagram of the vector addition system V. 
As an example, we give the reachability diagram corresponding to the 


vector addition system of example 2.3.1. 


(0,0,1,1) 


@ (1,0,0,1) 


Figure 2.3.1 The Reachability Diagram for the Vector Addition System of 


Example 2.3.1 
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If the reachability set of a vector addition system is finite, the reacha- 
bility diagram is a finite-state machine, and it is practical to draw it as 
we did for our example. If, on the other hand, the reachability set is 
infinite, we must find an alternative representational tool. For this 
purpose, we introduce the following terminology: 

(1) We define a quantity such that if n is an integer then n < &, and 
n+4= WW, The quantity © essentially represents infinity and we discuss 
its use shortly. 

(2) A rooted tree is a directed graph such that one node (the Foot node) 
has no arcs directed into it, each other node has exactly one arc directed 
into it and each node lies on a directed path from the root node. If 8 and 
y are distinct nodes of a rooted tree and there is a directed path from B to 
yY, we write B< y. If there is an arc directed from B to y, then y is a 
successor of B. A node with no successors is called a leaf. With an r- 
dimensional vector addition system V = (s,W), we shall associate a rooted 
tree T(V). Each node B € T(V) is labelled with an r-dimensional vector £(B), 
where each element of £(B) belongs to the set N {w}. A recursive procedure is 
given below for the construction of T(V). Note that the label for a node 
is assigned when the node is added to the tree. 

(1) Create a root node and label it s. 

(2) Let B be a node in the partially created tree with no successors. 

If, for some node y, y < B and £(y) = £(B), then B is a leaf in T(V). Other- 
wise, for each w € W such that w+ £(B) 0, add a node B. and make it the 
successor of 8. For each 8. the ith coordinate of £(B ) is assigned as 


follows: 
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(i) If there exists y < B such that £(y) < £(B) + w and 
Ay), < (£(B) + ws then L(B); = WW, 
(ii) If no such y exists, then LB), = (£(B) + Wie 
(3) Repeat step (2) until no new nodes can be added to the tree. 
We show in Appendix I that for any vector addition system V, the tree 


T(V) is finite. 


Example 2.3i2. As an example (taken from Karp and Miller), consider the 


following vector addition aystem V = (s,W). 


s = (1,0,0,0,0) 


W 


{(-1,1,0,0,0), (-1,0,0,1,0), (0,-1,2,0,0), (0,1,-1,0,0), (0,0,0,-1,2) 
(0,0,0,1,-1)}. 


The rooted tree T(V) is : Yi coal 


\ 
(0,0,0,0,0) 


(1,0,0,0,0) (0,0,0,0,0)—>(0,0,0,w,w) 
Noson,erei.s.r.0.ttash 


(0,1,0,0,0)-»(0,0;2,0,0)-»(0,1,0,0,0 (0,w,w,0,0) 


(0,0,0,0,0)-»(0,0,0,0,w) 


(0,0,0,0,w) 
Figure 2.3.2 


All our decidable results about vector addition systeme reduce to inspecting 


the tree T(V) and using the following theorem. 


Theorem 2.3.1: For any vector addition syatem Vand any integer vector x 
of the same dimension 


(dy € R(V) such that x < y) @ (FB € T(V) such that x < £(B)). 
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Proof: Given in Appendix I. 
Corollaries 


The following questions are decidable: 


Corollary 2.3.1.1 
Vy € RCV) is (y); <n for some finite n? 


Corollary 2.3.1.2 


Vy € R(V), is a given set of elements of the vector y simultaneously 


bounded? 


Corollary 2.3.1.3 
Is R(V) a finite set? 


In addition, if the questions in Corollaries 2.3.1.1 or 2.3.1.2: are true, 
the bounds can be effectively computed. For the vector addition system 
in Example 2.3.2, we see that V y € R(V), there are elements which be- 
come unbounded, and hence R(V) is infinite. Furthermore, the elements 


that can become simultaneously unbounded are (3), (5), (2,3) or (4,5). 


Petri Nets and Vector Addition Systems 


It is easy to see that for every marked Petri net there is a correspon- 
ding vector addition system. Let us explore this issue in detail. 

Let P be a Petri net with initial marking M. We can construct an 
equivalent vector addition system V = (s,W) as follows: 

The dimension of the vector addition system is made equal to the 


number of places in the Petri net, and the starting vector s is taken to 
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be the initial marking of the net. It now remains for us to show how 
to construct the set of vectors W. Consider a marking M' in the marking 


> 
class M, at which a transition t, is enabled. Since the firing of th 


h 
decrements the token content of each of its input places by one, and 
increments the token load of each of its output places by one, we can 
represent a firing of ue by adding a vector Yh to the: marking M' to yield 


the marking that results when transition t, is ‘fired. The elements of :the 
vector w, are given the following values: 

+1 if the corresponding place is an output place of th 
-1 if the corresponding place is an input place of th 
0 otherwise (we are assuming that there are no self-loops in P 3 


see figure 2.3.3). 


On| 


Figure 2.3.3. A Seif-loop 


We now assert that the vector addition system V is equivalent to the 
Petri net P with marking M in the following sense: 
> 
(a) For every marking xX € M, there exists a vector y € R(V) such 
that y = M.. 
” q 


(b) A vector addition sequence v is a sequence of vector additions 
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wy +... +W, such that it transforms a vector y, to vector y.. i.e., 


y —» y where 


We see that corresponding to every vector addition sequence v there 
exists a firing sequence 0 = — ceety in the Petri net P, and vice versa. 
We now establish some decidable results for Petri nets based on the 


decidability of the corresponding issues for Vector Addition Systems. 


Theorem 2.3.2 It is decidable if a given marking M for a Petri net Pp 


is bounded. 


Proof: Let ® be a Petri net and M its initial marking. By the construc- 
tion above, we can find an equivalent vector addition system V = (s,W). 
From Corollary 2.3.1.2, we know that the following is decidable: 

Given any vector y € R(V), is a given set of elements of y simul- 
taneously bounded? 


Thus, it its decidable if M is bounded for the Petri net P. 
We can now establish the following result: 


Theorem 2.3.3 It is decidable if a given marking for a Petri net is 


live and bounded (i.e., if a given Petri net is LB). 


Proof: Since we can decide if the given marking is bounded, we proceed 
to show that there exists a decision procedure to decide if a given 
bounded marking is live. For a bounded marking, the reachability set 


of the corresponding vector addition system V is finite. Hence, the 
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reachability diagram for V is finite. The marking M is live iff for 


> 
any marking M" € M there exists a firing sequence that fires every 
transition in the net. In terms of the reachability diagram, this 


means that starting at any vertex yy and given a vector wy € W, there 


must exist a vector addition sequence that contains “ 


ms 
f ~ 
¥ t 
j 1 
f J ! 
I t 
© f O f ~- * 
Ya 7 ‘ 
lA ‘ 
ly / 
( Fe 
~ Ps 
: Figure 2.3.4 


Consider figure 2.3.4. Find the set of vertices fy,» Yo» ---y} in 
the reachability diagram which have w, a8 an outgoing arc. We have to 
find if there exists a directed path from y, to at least one vertex in 
this set. Since the graph is finite, this can be done by exhaustion. 
The procedure is repeated for every vector ‘ € W, starting at the 
vertex y,- We now choose all the vertices in turn and carry out this 
test. The total number of tests involved is wl x Iecv) | where 

|w| = the number of vectors in the set W. 


Ircv) | = number of vectors in the reachability set of V. 


-52- 


Since both quantities are finite, the algorithm must terminate. This 
' proves the required result. s 


We now prove another theorem for a Petri net with a bounded marking. 


Theorem 2.3.4 It is decidable if a bounded marking M for a Petri net P 


is persistent. 


Proof: To check if a bounded marking is persistent, perform the following 


test for every marking in the marking class; 


> 
Let M' be a marking in the marking class M, and let {t!, ty, +04 be 
the set of enabled transitions. Let the firing of a transition 
t. 
kK 


ty € tty, oot} lead to the marking Ms i.e., M' = Mi. 
For the firing of each transition t,., we check to see if all other 
enabled transitions ty 7 ty are still enabled. If not, the marking M' 
is not persistent. If the marking M' is persistent for all of its 
enabled transitions then we check the next marking in the marking class, 
and so on until all markings M' eM have been exhausted. The marking 
M is persistent iff all markings M' € M are persistent for their enabled 
transitions. Since the total number of tests to be performed is finite 
and bounded, the algorithm must terminate. a 
This concludes our discussion of decidable issues in Petri nets. 
We have so far looked at Petri nets in terms of their markings and their 
marking class. In the next chapter, we look at Petri nets in terms of 


their structure. 


GHAPTER 3 


RELEVANT RESULTS FROM PETRI NET THEORY 


3.1 State Machine Decomposable Petri nets 
In Section 2.1, we saw how an asynchronous concurrent system can be 

viewed as an ensemble of interacting components, where each component is 
structurally similar to the finite-state machine of automata theory. Each 
component has a finite aaber of allowable states; since we are concerned 
with systems which have no redundant operators (see Section 2.1), the 
Petri nets of interest are LB. We will use a restricted class of Petri 
nets known as LB SMD (State Machine Decomposable) Petri nets to model 
asynchronous concurrent systems. In this Section, we formally define the 
notions of state machine and SMD Petri nets. 
Definition 3.1.1 A closed subnet Ii' of a Petri net FP is a strongly- 
connected Petri net (P*, T', A") where 

P' <P is a set of places, 

tT’ ST is a set of eae te tonat 

A’ © Ais a set of arcs, such that 

-P* = pts = f°, and 


A‘ = [ce' x T') U (CT' x PTD] AA, 


The Petri net N in Figure 3.1.1 has five closed subnets Nie Ny» N,> N, 
and N,. Clearly, every strongly-connected Petri net is a closed subnet 
of itself, because the relation °“P = P* = T is trivially satisfied. 
We are interested in identifying components or parts of a system that 


can be represented by a Petri net, and for this reason we would like to 


define a minimal structure which is part of a Petri net and is still a 
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Figure 3.1.1 
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(c) 


Figure 3.1.2 
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closed subnet. 

‘Definition 3.1.2 A closed subnet is a minimal closed subnet if and 
only if no closed subnet can be obtained by deleting any portion of it. 
Example 3.1.1 N is not a minimal closed subnet because by deleting 
appropriate places and/or transitions, the four closed subnets Ny> Ny» 
N, and N, are obtained. However, the closed subnets Ny» Ny» N, and N, 


are minimal closed subnets, as the reader can easily verify. 

Example 3.1.2 Consider the Petri net model of a pipeline processor 
shown in Figure 3.1.2(a). The closed subnets representing each indivi- 
dual stage are minimal closed subnets (see Figure 3.1.2(b) ).On the other 
hand, the closed subnet in Figure 3.1.2(c) is not minimal. All the Petri 
nets considered so far have the property that their minimal closed sub- 
ner have disjoint places. Lest the reader be under the impression that 
this is a requirement on minimal closed subnets, we would like to empha- 
size that this is not so. Consider the net in Figure 3.1.3(a). It has 


the minimal closed subnets S,, S,. and S$ 


1’ “2 3 
will now formally define the notion of state-machine based on a suitable 


shown in Figure 3.1.3(b). We 


structural restriction on Petri nets. 


Definition 3.1.3 A Petri net f is a state-machine if and only if every 


transition has exactly one input place and exactly one output place. 


Example 3.1.3 For an example of a state-machine, see Figure 3.1.4. 
Definition 3.1.4 A Petri net 2p is said to be covered by a set of closed 


subnets { n,, NL} if and only if 


grees 
DP «= CUP LUT YA). 


Example 3.1.4 The Petri net in Figure 3.1.2(a) is covered by the set 
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Figure 3.1.3 
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Figure 3.1.4 A State Machine 


of closed subnets {s,, S55 s,}. Similarly, the get in Figure 3.1.3(a) 
is covered by the subnets of Figure 3.1.3(b). 

Definition 3.1.3 A Petri net P is SMD (State Machine Decomposable) iff 
every minimal closed subnet is.a state acniae aaa thére exists a set of 


state machines {s,,---S,} which covers the net. 
Example 3.1.5 The Petri nets in Figures 3.1.2(a) and 3.1.3(b) are SMD. 


On the other hand, the net in Figure 3.1.1(a) is not SMD, because the 


3 and N, are not state machines. 


Definition 3.1.6 The token content of a Petri net {P, T, A) with a 


minimal closed subnets Ny> N 


marking M is the number of tokens on all places in the net, and is given 


YN) = EM(,) 
P,€ P 


Lemma 3.1.1 The token content of a marked state machine is invariant 


under transition firings. 


Proof Suppose we consider a state machine P= {P, T, A) with mar- 
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king M. The only way in which the token content N(p) of the state machine 
can change is by the firing of a transition. However, the firing of a 
transition removes exactly one token from the net and adds exactly one 
token to the net. Thus, the firing of a transition does not change the 


number of tokens on the net. 
ry 


Theorem 3.1.1 Every marking for an SMD Petri net is bounded. 

Proof An SMD Petri net has a finite number of state machine components. 
Let these components be S,>---S,- Also, by Lemma 3.1.1, the token content 
of each state machine is invariant. Let the token content of the nth state 
machine be N(P). Now, recall the definition of a bounded marking for a 
Petri net. We have to show that for each place p in the net, there exists 
an integer Z(p) such that M(p) < Z(p) for each M in x. 

Let S31°** aye be the set of state machines which contain place p. 
Then, if N(S),)>-- NCS.) are their respective token contents, we see 
that Z(p) < ainf[ N(S,)>---N(S DI. If Z(p) were greater than 
nin [ N(S)4)>+--N(S I, it would imply that there exists a state machi- 
ne s.. such that Z(p) > N(S)); which is impossible. Hence, for each 
place p in the net, we can find an integer Z(p) which bounds its token 


content. This proves the Theorem. 
a 


Corollary 3.1.1.1 It is decidable if any given marking M for an SMD 
Petri net Pis live. 

Proof By Theorem 2.3.3, it is decidable if a bounded marking M for 
Petri net Pis live. Also, since any given =o for P is bounded, 


the desired result follows immediately. 


Instruction type A 


© 
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Figure 3.1.5 SMD Petri net model of a pipeline processor with decisions 
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‘Figure 3.1.6 State Machine components of net in Figure 3.1.5 
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This result is useful because it tells us that we can decide if a given 
, ere M for a Petri net ? is live, but it does not give us an effec- 
tive procedure for constructing a live marking for P. We will look at 
this issue in Sections 3.3 and 3.5. Before we proceed to the next Sec- 
tion, we give the reader an example which illustrates the descriptive 
power of SMD Petri nets. The example of the pipeline processor we gave 
in Figure 3.1.2(a) was that of a deterministic system. In Figure 3.1.5, 
we give an SMD Petrinet model of the pipeline processor for the compu- 
ter system with three instruction types which we digcussed in Section 
1.2 (see Figure 1.2.3). Figure 3.1.6 shows the net in Figure 3.1.5 
split up into its state machine components, which are labelled Ss, throu- 


gh Ss The reader will note that the decision about processing an instr- 


9° 
uction after it has been decoded can be made in the state machine 83° 


Place p in state machine S, has three ‘output arcs, one for each instruc- 


3 
tion type. We said in Section 1.2 that in order to estimate the pro- 
cessing rate of such a pipelined processor, we must know the relative 
frequency of occurrence of each of the instruction types A, B and C. In 
other words, we must know the relative frequency of occurrence (or the 
probability) of each of the output transitions of place p. Thus, on the 
average, the rates at which each instruction type occurs will be reflec- 
ted in the number of times each of the transitions t ty and t. fires 
in the long run. We continue this train of thought in the next Section, 
where we present the important notion of consisteacy. In the meantime, 
the reader should explore the descriptive power of SMD Petri nets by 


constructing examples of his own. 


See ee ee Tae ne ee 
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3.2 Consistent Petri ets 

In this Section, we will look at the number of times any transition 
in an LB Petri net can fire relative to the other transitions in the 
net. We will first need some definitions. 
Definition 3.2.1 A current assignment for a Petri net P= (P,7,A) ts 
a function % that assigns to each transition t, € T a positive rational 
number ?; called its current. A current assignment for a Petri net 
must satisfy the following two constraints: 
(1) Every are carries a current equal to that associated with the transi- 
tion it is connected to. 
(2) At every place, the sum of the currents on the input arcs must equal 
the sum of the currents on the output arcs (i.e., Kirchoff's current law). 


. 


Definition 3.2.2 A Petri net is consistent iff it has a current assign- 


ment § with 9, > 0 for each t, a 


Checking the Congistency of a Petri net 


Figure 3.2.1 


-64- 


Consider the Petri net shown in Figure 3.2.1. Let each transition t, in 
the net be assigned a current 95° For each place, we can write an equa~ 


tion that specifies a constraint on the input and output currents. 


Py! -% == ® $9, = 0 .. () 
Po: +9 - P, = 0 ... (2) 
P33 +, - %3 = 0... (3) 
P,? + %5 <0, = O .e- (4) 


From Equations (2), (3) and (4) we get, 
9 = 9 = % = = @ (say). 
This violates Equation (1), which requires that 
% = FFT * 29. 
We conclude that the net in Figure 3.2.1 is not consistent, or is 
inconsistent. A slight modification of this net leads to the consistent 


net of Figure 3.2.2. 


Figure 3.2.2 
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The reader will note that ail we have done is to feed another current of 


magnitude @ into the place Py: The equations become 


Ppt - 6, 7 by se) a 
Py ? + %5 = P = 0 
ie + ¢, = * = 0 
Py 3 + 5 + $9, = 0 
Pp. + + ¢, eS te, eee 


These equations are, indeed, consistent and we get the following consis- 
tent current assignment: 

$, = $ = $3 = % = $5 = $ , where » is any 
non-zero real number. 

Consider a consistent current assignment for a Petri net P. Multi- 
ply all currents by the least common multiple of their denominators, and 
divide each resulting current by their greatest common divisor. The 
resulting currents are said to form a minimal integer consistent current 
assignment for P . The reader will note that consistency is a purely 
structural property of a Petri net. The following Theorem and discussion 
explain the relationship between the notion of consistency and the 
structure of LB Petri nets. The ideas in the following material have been 


partly inspired by Baker [B4]. 


Definition 3.2.3 Let # be a Petri net with marking M. A cyclic firing 
sequence is a firing sequence o which fires every transition of Pat 


least once and brings the marking of the net back to M. 
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Definition 3.2.4 The firing count of a transition or in a firing se- 
quence o is the number of occurrences of t, ino. The firing vector ¥ 
of a firing sequence o is a vector whose ith element yp(i) is the firing 


count of transition t. 


Theorem 3.2.1 A Petri net P is consistent if and only if there exists 


an initial marking M for which there exists a cyclic firing sequence. 


Proof: Necessity 

Consider the minimal integer.consistent.current .assignment. derived from 
the given current assignment. Let this: current- assignment be © . We will 
show how to construct a finite firing sequence o whose firing vector ¥ 


is such that (1) is the current through transition t We construct 


i’ 
the marking M as follows: M(p,) must be big enough so that firing its 


output transitions tre Py w(i) times does not drive the token load of 


Py negative, i.e., M(p,) = Sea : 


fie PK 
( 
Under this marking, a cyclic firing sequence o is given by 
yQ) w(n) 
= t eon t 
1 n 


where na are the transitions in the Petri net 7. Since 6 is a 
consistent current assignment, the firing sequence o is such that for any 
place Po the number of tokens removed by o is equal to the number of 
tokens added by o to Pye The marking M‘ after o has occurred is the 


same as the marking M before the occurrence of o , so that o is a cyclic 


firing sequence. 
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Sufficiency Let 9 be a cyclic firing sequence for the net, and let 


Y be the firing vector of o . With each transition t,, associate an 


4? 
integer valued current equal to §(i). By definition of current, each 
input and output arc of t. has a current (i) associated with it. Under 
this assignment, every transition has been assigned a non-zero integer 
current, and each transition has been assigned a unique current. Now 
consider any place Py: By the definition of a cyclic firing sequence, P, 
has the same number of tokens before and after the cyclic firing sequence 
has occurred. This implies that the sum of the tokens entering place P, 
is the same as the sum of the tokens leaving Pye i.e., the sum of the 
input currents equals the sum of the output currents for every place Py: 
This is the definition of a consistent current assignment. 
a 
We have pointed out in Chapter 2 that LB Petri nets are the only ones 


of interest to us. We now establish the connection between consistency 


and LB Petri nets. 


Theorem 3.2.2 A Petri net ? which has a live, bounded marking M is 
consistent. 

Proof We have to show the existence of a cyclic firing sequence which 
fires every transition at least once. Since the marking M is bounded, its 
reachability diagram is a finite-state machine. Alao, since M is live, 
there must exist a strongly-connected portion of the reachability diagram 
that contains every transition name at least once. Let M' be some node 


(marking) in the strongly-connected portion of the reachability diagram. 
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Figure 3.2.3 
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We know that in a strongly-connected graph, there exists a chain (i.e., 
a sequence of directed arcs, all arcs directed in the same sense) that 
starts and ends at M' and which goes through every arc once. This chain 
can be used to describe a cyclic firing sequence, and since Pf has a 
live marking, every transition name must be included at least once in 


this firing sequence. 


As a Corollary to Theorems 3.2.1 and 3.2.2, we have the following: 
Corollary 3.2.1 If a Petri net Pis LB, then there exists an initial 


marking M for which there exists a cyclic firing sequence. 


Proof Follows by combining Theorems 3.2.1 and 3.2.2. 


For completeness, we mention that the converse of Theorem 3.2.2 is not 
true. The net in Figure 3.2.3 provides the necessary counter-example. The 
reader can verify that any initial marking of the net can lead to a mar- 
king like the one shown in Figure 3.2.3(b), at which no transition is ~- 
enabled. 

We would like to consider the connection between consistency and the 
structure of LB SMD Petri nets, because that is the class of Petri nets 
we will use for modelling asynchronous systems. We will look at the 
concepts of cyclic firing sequence and consistency for SMD Petri nets 
in some depth in Section 3.5. Before we can do this, we look at a det- 
erministic subclass of Petri nets known as event graphs. We do this in 
Section 3.3. In Section 3.4, we complete our study of those aspects of 


Petri net theory which are relevant to this thesis. 


-70- 


3.3 Event Graphs 


Event graphs are what Commoner and Holt have called Marked Graphs[C1]. 
We have chosen to call them event graphs because we would like to pre- 
serve the distinction between the structure and marking of a Petri net. 
In this Section, we would like to define event graphs and to state 
some results that we will need in Chapter 4, where we look at the idea 
of timing relationships between activities in deterministic systems. 
Definition 3.3.1 An event graph is an SMD Petri net in which every 


place has exactly one input transition and exactly one output transition. 


Example of an event graph 


Figure 3.3.1 An event graph 


Recall the definition of conflict given in Section 2.2, It should be 
noted that no transitions in the net are in confliet. Every marking for 
an event graph is persistent, and every event graph is a deterministic 


Petri net. The structure of event graphs enables us to define them in 
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the following convenient fashion: 
Definition 3.3.2 An event graph is a strongly-connected directed graph 
G = (v,A) 
where V = {v,.--.v} is the set of transitions 
A = fa, ,} S VxXV_ is the set of arcs, where arc ay conne- 


ets transition ‘5 to ma 


A marking of an event graph is a function that assigns to every arc as, 
in the net a non-negative integer called the token load of the arc. Note 
that in this definition of event graphs, an arc corresponds to a place 
together with its incombng and outgoing arc in Definition 3.3.1 (see 


Figure 3.3.2). 


eee |} —~}—__+ 


An arc in Definition 3.3.2 Corresponding structure in 
Definition 3.3.1. 


Figure 3.3.2 


Figure 3.3.3 gives an event graph corresponding to the one in Figure 


3.3.1. 


Event graphs are seen to be conflict-free SMD Petri nets in which 


each simple circuit is a state-machine component. 


Theorem 3.3.1 (due to Commoner and Genrich) 


A marking for an event graph G is live if and only if the token content 


Figure 3.3.3 
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2735 


of every simple circuit is non-zero. 

Proof 

Necessity If the token content of some simple circuit is zero, no tran- 
sition in this circuit can be fired; since the token content does not 
change if other vertices are fired (lemma 3.1.1), no vertex on this cir- 
cuit can be enabled through transition firings. 

Sufficiency Assume that the token content of every simple circuit is 
non-zero. Let vs be any transition in the net. Consider the unmarked 


arcs entering Vye 


I 


Figure 3.3.4 


If there are none, the vertex is enabled. If not, consider the vertices 
from which these arcs emanate. If each of these ig enabled, then clearly 


v, will become enabled after all of them are fired. If some are not, 


i 
consider the unmarked arcs entering them, etc. As we continue this 
back-tracking, we are selecting a subgraph of G which consists of Vio the 


vertices from which these arcs emanate, the unmarked arcs entering them, 


etc. This process must terminate, since G is finite. Now this subgraph 


ake 


must be circuit-free (i.e., a tree, as shown in Figure 3.2.4), since 
there are no token-free simple circuits. Thus, the subgraph must have at 
least one transition that has no incoming arcs which belong to the sub- 
graph. This vertex can be fired at the present marking of G. After firing 
it, the subgraph of token-free back-tracking from vy is reduced by one 


transition. By repeating this process, we can enable Vie 
r] 


There is one more result we will need in connection with event graphs, 
the proof of which is given in [C1]. 

Theorem 3.3.2 A live marking for an event graph is safe if and only if 
every arc is contained in some simple circuit containing exactly one 


token. 


Example Figure 3.2.5 gives an example of an event graph with a live, 
safe marking. Note that every circuit has at least one token on it, and 


that every arc is contained in some one-token circuit. 


Figure 3.3.5 
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It is easy to see that event graphs have a live, safe marking that is 
persistent, and are thus deterministic nets. However, event graphs are 
not the only Petri nets with this property. In the next Section, we 


study a larger class of deterministic Petri nets. 


276s 


3.4 LSP Petri nets (Petri nets with a live, safe and persistent markin 


The class of Petri nets is a large one, but we will confine our atten- 
tion to LSP Petri nets because their behavior can be represented by Petri 
nets whose structure is very similar to that of event graphs. The mark- 
ed Petri net in Figure 3.4.1 is a good example of a Petri net which has 
a live, safe and persistent marking but which is not an event graph. The 
reader can verify that this net is LSP by drawing its reachability dia- 
gram, which we give in Figure 3.4.2. It is seen that in no marking in the 
forward marking class can there be more than one token on any place. Fur- 
thermore, at each marking exactly one transition is seen to be enabled. 
This verifies our claim that the net is LSP. In general, one can determine 
if a marked Petri net is LSP by using the results of Section 2.3. As we 
have pointed out before, the reachability diagram of a marked Petri net 
obscures the concurrency that is inherent in the system it represents, and 
this is precisely what happened when we drew the reachability diagram for 
the net in Figure 3.4.1. We now present an alternative means of represen- 
ting the operation of LSP Petri nets. We will do this by constructing for 
LSP Petri nets a graph known as a behavior graph, and shall explain its 


construction by means of an example. 


Representing the Behavior of LSP Petri nets: Consider the LSP Petri net 
? shown in Figure 3.4.3(a). We begin by drawing and labelling the set 


P < P of marked places in P ,(in this case {p,, Py» Pd ). Let 


Ty = it, st} be the set of enabled transitions corresponding to Pe and 


note that eo corresponds to the initial marking of P . Let Po be the set 


of markéd::places that results when all transitions in Ty are fired. Draw 


775 


Figure 3.4.1 


(0,0,1,1) 


Figure 3.4.2 
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all arcs P X Ty that are contained in PxT. Draw the places = Ty : 
1 


Draw all the arcs qT, x PL which are contained in T™ P. 


Define P= me : 
efine 9 @, T,) x Ty) 


This process of constructing the marked places Pd that results when 


all enabled transitions for ES are fired is called extending the behavior 


k+1 
graph can be extended indefinitely. Figure 3.4.3(b) shows the behavior. - 


graph from Pa to P . Since P has a live marking, its behavior 


graph of P. We now make some definitions. 


Definition 3.4.1 A chain in a behavior graph is any directed path in 
it. 


p B. 
Example Pt P3tsP. ro is a chain in 


Definition 3.4.2 A slice of a behavior graph Bis a set of places 
that forms a cut-set of @. 
Note In a cut-set, no two elements of the set can belong to the same 


chain. 


Example P = {P,> Po» Pe} is a slice of B, 


Each slice of a behavior graph corresponds to a marking of the LSP Petri 


net. 


Lemma 3.4.1 Each place in a behavior graph for an LSP Petri net must 
have exactly one input transition and exactly one output transition. 
Proof Suppose some place p in the behavior graph 8 has more than one input 


transition. Then, there must exist a marking M' in the marking class of 
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LSP Petri net £7 e 


(a) 


Behavior Graph B. 


(b) 


Figure 3.4.3 


Cyclic Frustrum of B. Steady-state equivalent net S. 


(c) (d) 


Figure 3.4.3 
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the net Pin which more than one token is placed on Py» implying that ? 
has a marking that is not safe. Hence, every place in ®Bhas exactly one 
input transition. 

Also, suppose some place p in * has more than one output transition. 
This implies that there exists a marking M’ in the forward marking class 
for ? at which more than one output transition is enabled. This violates 
the assumption that Phas a persistent marking. Hence, each place in P 


has exactly one input and exactly one output transition. 


Lemma 3.4.2 There exists a slice in the behavior graph ® of an LSP 
Petri net ? that occurs repeatedly. 

Proof Each slice of @ corresponds to a marking of P. Since P has a 
safe marking, the number of distinct markings in the marking class for 
the net is finite. Therefore, since the behavior graph is infinite, 


there must exist a slice in ® that occurs repeatedly. 


The reader is now asked to refer to Figure 3.4.3. The slice ei repre- 
sents the initial marking of P. P, does not occur again in B.elA 
behavior graph that has slices which do not repeat is said to have a 
transient, and such slices are termed transient slices. In terms of the 
Petri net P , a transient slice represents a marking that cannot be rea- 
ched after an initial occurrence. The slice P, (shown dotted) is a slice 
which has repeated occurrences in B.a cyclic frustrum of Bis the por- 
tion of B between two consecutive occurrences of some repeated slice. In 
Figure 3.4.3(c), we show a cyclic frustrum of ® bounded by consecutive 
occurrences of the slice ip,» Py? Pt in @ . since Bis derived from 


a persistent marking of ? » only one way is possible of extending this 
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to {Ps Py? Pe} . In general, for any slice PL of the behavior graph, 
there exists a unique extension Fa Thus, every cyclic frustrum bounded 
at each end by the slice ip. Py? Pes is identical to every other cyclic 
frustrum bounded at each end by this slice. Hence, instead of drawing an 
infinite behavior graph B , we will choose some cyclic frustrum of BB and 
then coalesce corresponding slices together in the initial and terminal 
slice of the cyclic frustrum. The net so obtained is termed the steady- 
state equivalent net of Pp, and is shown in Figure 3.4.3(d). The marking 
M for the steady-state equivalent net S& is obtained by putting one 
token on each place in the initial slice. The construction of this net 
is such that the set of firing sequences of & is identical to that of 
the net 7 . Also, the reachability diagram of & is contained in that 
for ? , there possibly being some extra states in P corresponding to the 
transient. The reader can see this from the graph By, and it is not 
necessary to construct the reachability diagram of P, 

We have shown that in B, and hence in 4 » each place has exactly 
one input transition and exactly one output transition. dF thus has the 
structure of an event graph, with the difference that certain place and 
transition names occur more than once in it. Transition t, and places 


p, and p, occur twice, for example. Such an event graph is termed a 


3 
multiply—labelled event graph. We will not define multiply-labelled event 


graphs formally, but will merely say that they are event graphs in which 

certain places and transitions have repeated occurrences (or instances). 
The LSP Petri net P has the property that it is possible to add tokens 

to certain places and still have a marking that is live and bounded, but 


no longer safe. Tokens can be added to all places except Pe and Po» such 
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Figure 3.4.4 
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that the resulting marking is live, bounded and persistent. The steady 
state equivalent net for FP with this new marking is simply the net 3B; 
the marking for & is constructed by adding the same number of tokens to 
a place in J as were added to the corresponding place in P . If there are 
multiple instances of a place in 3, then tokens can be added to any of 
those instances, provided the sum of the tokens added to all instances 
of a place equals the number of tokens added to the corresponding place 
in ?. Figure 3.4.4(a) shows a live, bounded, persistent marking for 
and Figure 3.4.4(b) its corresponding steady-state equivalent net Jb. 

We should mention that there exist Petri nets which have a live, 
bounded marking but no live, safe marking. Figure 3.4.5 shows such a 
Petri net. Thus, the preceding technique cannot be used to construct a 


steady-state equivalent net. 


Figure 3.4.5 
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3.5__ The Structure and Consistency of SMD Petri nets 


3.5.1 Existence of a live marking for an SMD Petri net 


In Section 3.3, we saw that for event graphs a necessary and sufficient 
condition for a marking to be live is that there be at least one token on 
each simple circuit. Since a simple circuit in an event graph corresponds 
to a state machine in an SMD Petri net, the reader may be tempted to ask 
if we can get a live marking for _an SMD net by adding at least one token 
on each state machine component. The nets given in Figure 2.1.9(c) and 
2.1.10(c) are counter-examples to this conjecture. In Figure 2.1.9(c), a 
subset of the transitions in the net can never be fired, whereas in 
Figure 2.1.10(c), no transition can be fired. Both are examples of Petri 
nets which do not have a live marking. We now give a necessary condition 


for a marking M for an SMD Petri net ® to be live. 


Theorem 3.5.1 A marking M for an SMD Petri net is live only if the 
token content of every state machine is non-zero. 

Proof We will show that if the token content of some state machine is 
zero, then no transition in it can be fired. Without loss of generality, 
let 8 be a state machine component of # and let t, be a transition in 
v Ty» A,? then t, € T° By hypothesis, the input 
place Py of transition t, in state machine S 


it, i.e., if Pe = <P 
i 1 must be unmarked. Also, by 
Lemna 3.1.1, it must stay unmarked. This implies that there is no marking 
M' ¢ M at which transition t can be fired. Hence, the marking M is not 
live. Note that no transition in 3 can be fired at any marking M" c M. 

es 
Hack[H5] has shown that a subclass of SMD Petri nets called SMA ( state 


machine allocatable) Petri nets are similar to marked graphs in that any 
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(a) 


Reduced net No. 1 


Reduced net No. 2 


(b) 


Figure 3.5.1 
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marking that puts at least one token on every state machine is live. We 
proceed to describe the structure of these nets in what follows. The ma- 


terial in Subsection 3.5.2 is taken from Hack[Hl, H5]. 


3.5.2 State Machine Allocatable Petri nets 


Definition 3.5.1 A state machine allocation over a Petri net <P,T,A) 


is a function B: Tt-~}pP such that 


¥tewtT B(t) € “te 


Informally, this means that for each transition in the net, we pick 
one of its input places and ignore the others. Given such a state machine 
allocation, we perform a reduction by deleting certain places and transi- 
tions in the following manner: 
Step 1 Delete all places for which at least one output arc has been 
deleted. 
Step 2 Delete all transitions that have all output places already deleted. 
Repeat Steps 1 and 2 until neither is applicable anymore. What is left 
over is the reduced net. Each step eliminates some nodes and arcs that are 


not part of the reduced net, until no more nodes and arcs can be deleted. 


A Petri net is said to be state-machine allocatable iff every state 
machine allocation gives a reduced net that is either a strongly-connected 
state machine or a set of strongly-connected state machines. We will 
abbreviate the last remark by using the contraction "SSM" to denote "a 
strongly-connected state machine or a set of strongly-connected state 
machines", In Figure 3.5.1(a) we give an example of a Petri net which is 


state-machine allocatable. In Figure 3.1.5(b), we show two allocation 
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Figure 3.5.2(a) 
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SMA Reduced net No. 1 


SMA Reduced net No. 2 


Figure 3.5.2(b) 
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An SMD Petri net that is not SMA 


Figure 3.5.3(a) 
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Figure 3.5.3(c) Non-SMA Petri net with a live 


marking 
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reduced SSM's. Reduced net No. 1 results from allocating 7 to Py and 
reduced net No. 2 results from allocating t, to p,. Since each of the two 
allocation reduced nets is an SSM, the net is SMA. 

Let us give another example. Consider the Petri net in Figure 3.5.2. 
Since each of the state machine allocation reductions is an SSM, the net 
is SMA. 

We now give an example of a Petri net which is SMD but not SMA. Such a 
net is given in Figure 3.5.3. We see that two of the SMA reductions are 
SSM's and the other two are null; the net is not SMA. In case the reader 
is a little confused, we would like to point out that the property of a 
Petri net being SMA is a structural property of the net, and he will note 
that no mention has been made of markings so far. 

The reason we have introduced SMA Petri nets is that they form the 
largest subclass of SMD Petri nets we know of that has the property that 
any marking which puts at least one token on each state-machine component 
is live. This is the result presented in the next Theorem. We present 
this Theorem without proof and readers interested in the details are refe- 


rred to Hack{H1,H5]. 


Theorem 3.5.2 ( The SMA Petri net liveness Theorem) A marking M for 
an SMA Petri net # is live iff it puts at least one token on each state- 


machine component of the net. 


This Theorem is of interest because it tells us how to construct a live 
marking for any SMA Petri net- simply put one or more tokens on each state 
machine component of the net. The non-SMA Petri net shown in Figure 
3.5.3(a) also has the property that any marking that puts at least one 


token on every state machine is live, as the reader can easily verify. 
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Figure 3.5.3(c) shows the Petri net of Figure 3.5.3(c) with a live mar- 
king. We present the following as an open problem: © 

What is the largest class of SMD Petri nets which has the property 
that any marking that puts at least one token on every state machine 
component is live? 


We now turn to examining the issue of consistency for SMD Petri nets. 


3.5.3 Consistency of SMD Petri nets which have a live marking 


Recall Corollary 3.2.1, which we repeat here for convenience: 
Corollary 3.2.1 If a Petri net P is LB, then there exists an initial 


marking M for which there exists a cyclic firing sequence. 


In Section 3.4, we established the connection between a cyclic firing 
sequence and the steady-state equivalent net for event graphs and LSP 
Petri nets. We now introduce a concept similar to the steady-state equi- 
valent net in connection with live SMD Petri nets. A firing sequence, as 
we have pointed out in Section 3.4, expresses an ordering relation on tra- 
nsition firings in a fashion which obscures the concurrency that is in- 
herent in the Petri net. To preserve this inherent concurrency, we intro- 
duced the behavior graph for LSP Petri nets. We will now introduce a more 
general concept to study the behavior of SMD Petri nets- the occurrence 
graph. This notion is not a new one, having been studied extensively by 
Holt{H6]. As we did for behavior graphs, we shall illustrate the cons- 
truction of an occurrence for an LS SMD Petri net by means of an example. 


We begin with a live, safe marking for an SMD Petri net(see Figure 3.5.4). 


Construction of Occurrence Graph Begin by drawing every marked place 
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Figure 3.5.4 
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Occurrence Graph No. 1 


Occurrence Graph No. 2 


Figure 3.5.5 
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in the net together with a token on each place (i.e., places Py and Ps)- 
Now, since a place may have more than one output transition, we may have 
several enabled transitions connected to the same place. In event graphs 


and LSP Petri nets, we saw that every place “has at_most one transition 


enabled at any marking. This is not | necessarily hrve for SMD Petri nets. 
As an example, consider the. marked Petri net of Figure 3.5.4. Both the 
output transitions t. and ts. of place Ps are. eon, to ine enabled; since we 
can fire only one of these transitions, we must: waka: a choice between ‘them 
and fire the one we choose. We will extend 2 _Blace in an occurrence gtaph 
by drawing the output transition chosen for. firtng, firing that transition 
and then drawing the new marked place(s) that result. A slice of an occu- 
rrence graph is a set of places that forms a cut-set of the graph. We can 
talk about extending a slice in the same way as we did for behavior graphs, 
the difference being that a choice ‘day ‘have to bé wade Between enabled tra- 
nsitions. In the construction of a behavior graph) there fever occurs a slice 
for which a choice has to be made between output transitions. Thus, there is 
only one behavior graph for an event graph or an LSP Petri net, and this 
ah is unique. On the other hand, several occurrence graphs may be possi- 
ble for LS SMD Petri nets. In Figure 3.5.5, we show two possible occurr- 
ence graphs for the net in Figure 3.5.4. The dais will realize that an 
infinite number of occurrence graphs is possible for this net, or, for that 
matter, in any LB net with a non-persistent marking. 

Now let us apply Corollary 3.2.1 to the occurrence graph for an LS 
SMD Petri net. An occurrence graph is a concurrent representation of a 


firing sequence for a Petri net, and each slice represents a marking 


of the net. A repeated slice thus represents the repeated occurrence of 
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Consistent current assignment 


Figure 3.5.6 
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Figure 3.5.7 Cyclic Frustrum 
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Figure 3.5.8 c-equivalent net 
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some marking. The portion of an occurrence graph between two consecutive 


occurrences af a slice is termed a cyclic frustrum, and corresponds to a 


cyclic firing sequence for the marked net. Theorem 3.2.1 tells us that 
for any consistent current assignment for an LB Petri net, we can find 
a cyclic frustrum in the occurrence graph of the net. The number of 

occurrences of any transition in the cyclic frustizim equals its current 


‘. 


in a consistent current assignment. , 


Example 3.5.1 Consider the SMD Petri net of Figure 4.5. 4, In Figure 


ee 


3.5.6, we show this Petri net with a minina, Aapeage consistant current 
assignment. We draw the occurrence graph os Aistyosed earlier, and Figure 
3.5.7 shows a cyclic frustrum of ena ooparrence § sraph,/ in which the multi- 
pretty of each transition equals ‘igs dave tates current in the consistent 
current assignment exhibited ‘in Figure 3.5.6. We now coalesce correspon- 
ding places in the repeated slice ae manner similar to what we did for 
behavior graphs. The resulting ‘atrongly-connected net is termed a consis- 
tency -equivalent net for the SMD Petri net, abbreviated to "c-equiva- 
lent net" . The c-equivalent net for eka’ cyélae frusttlm Of Figure 3.5.7 
is shown in Figure 3.5.8. : . 

Let us now note some facts about the reletionghip between an SMD Petri 
net and its c-equivalent Petri net. Let us begin by saying ‘that we will 
consider only minimal integer consistent current assignments. Note that 
the c-equivalent net of:an-SMD Petri-net P49 not unique. Figure 3.5.9(a) 
shows an SMD Petri net P with a minimal integer consistent current assi- 


gnment. The net QR has several c-equivalent nets, two of which are 


shown. We observe that in the construction of a c-equivalent net for an 
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Figure 3.5.9(a) SMD Petri net P. 
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Figure 3.5.9 (b) 
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SMD Petri net # with a minimal integer consistent current assignment, 
each state-machine in P corresponds to a circuit in ‘the édquivwlent net. 


We formalize this in the following Lemma. 


Lemma 3.5.1 Let # be an’LS SMD Petri net with a minimal integer consis- 
tent current assignment :, and let 6 be a c~equivalent net for #. Then, — 
every state machine component in 7 corresponds to a simple eiseuat iné. 
Proof Let F be the cyclic frustrum corresponding to J. Applying 
Theorem 3.5.1, oe see that the initial slice of # must contain at least 
one place from each state-machine in - : Now consider*a chain in # boun- 
ded at its extremeties by two consecutive instances of some place Py: In 
J » this chain corresponds to a simple circuit. By the construction ~ 
procedure for Occurrence graphs, it is obvious that each state machine in 


corresponds to a chain in ¥ . This proves the Lemma. 


Example Consider the LS SMD Petri net of Figure 3.5:6. Each of the two 
state machines corresponds to a circuit in the c-equivalent net shown in ~ 
Figure 3.5.8. Note that there are two circuits in the c-equivalent net 


which do not correspond to any state-machine in the SMD Petri net. An - 


example of such a circuit is Pot yPstsP4t,P3°3PotP4° Po . 


: ee 
Since the c-equivalent net corresponds to a cyclic firing-sequence for an 
LB SMD Petri net, we see by applying Theorem 3.2.1 that the multiplicity 
of a transition in the c-equivalent net must equal itg current in a mini- 


mal integer consistent current assignment. 


To recapitulate the main results of this Section, we have the following: 


(1) We have introduced the subclass of SMD Petri nets known as SMA Petri 
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nets, which have the property that a marking is live if and only if each 
state machine contains at least one token. Thie*is the largest class of 
SMD Petri nets known to date that has this property - 

(2) We have introduced the c-equivalent net of a . Petri net which has a 
live, bounded marking. | For SMD Petri notes we have seen that every ‘state- 


maths corresponds to a circuit ina crequivalent. net. ‘The converse is 


not ‘true in general. 


We now turn to the issue of “upsiving Petri net ‘theory to the analysis 


of asynchronous concurrent eystens. 


CHAPTER 4 


TIMED PETRI NETS 


4.1 Timing. in Petri Nets 


In our discussions so far, ss have not entered into any timing con- 
siderations in connection with Petri nets. Thus, while Er Petri nets 
can model the structure of asynchronous concurrent syetens, they do not 
contain enough information to be used for a study of Jaance ‘of perfor- 
mance of the type discussed in Chapter 1. In defining Petri Bete we 
made no assumptions about the length of time it takes for a transition 
to fire. In real-world systems, activities do not take place instan- 
taneously. Every activity in a system has a time duration which is 
different from zero, and in all the systems we will model, we will 
make the added assumption that all activities complete in a finite 
amount of time. In the Petri nets that we use to model these systems, 
we will assume that every transition takes a bounded, non-zero amount 


of time to fire. The resulting model of asynchronous, concurrent systems 


is termed Timed Petri nets, and is formalized below. 


Definition 4.1.1 A Timed Petri Net is a pair ‘2%,0) where ? is a 
Petri net (P,T,A) and © is a function that assigns a real, non- 


negative number 7; to each transition ty in the net. 


2; T——_—» BR {RM is the set of non-negative real 
numbers }. 


This non-negative real number T : * Ne Py is termed the firing time 


of transition t.- 
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The operation of the net can now be assumed to take place in real time. 
At any instant T of real time, the net has a thatking M(T), with the 5 
underst anding that we may view M both as a wectew: and a function. We 
denote the initial marking as M(0). We write MCT p) Yor, M(r)(p): i.e., 
the numbey ‘of tokens om. place P at tine tT. A transition t is said to 


i 
be enabled at time rT if and only if every input plac¢@ of transition t 


i 
has at least one token on it, i.e., ° 
M(T,p,) = 1 for all p, € ‘t,. 
When transition t, is enabled, a firing can be initiated. When a firing 


is initiated, a token is removed from each input place of t i and 
transition t, ig said to be executing. This execution phase lasts 


for T, seconds, where T, is the firing timé of transition t,. At the 


i 
end of this time duration, the firing of transition t, terminates, and 
a token is placed on each output place p j € ty*- This completes the | 
firing of transition t,- By fe 88 oe BA | 

The three phases of a transition firing can be visualized by. imagining 
every transition as consisting of two. transitions end an intermediate 


place as shown in Figures 4.1.1. The firing time of the transition t, 


can now be associated with the place *, in the following fashion: 
When transition t, initiates, t, ' fires inatant aneously, a token’ 
fi@ removed from each input Place. of t ‘y ’ and | a. taken ; is deposited on 
place m, . This token is held by Place. x, , for a duration equal to Ty 
the firing time of t,- At the end of thie . interval, transition tif 
fires, corresponding to the termination of | a on 
The initiations and terminations of transitions in a Petri net must 
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(a) (b) 


Figure 4.1.1 


satisfy the following balance equation: 
Notation 
Let p be a place in a Petri net P. Let trpeeetan ‘be the input 


transitions and Cie lm the output transitiohs of p. 


Let I(T,t) denote the number of initiations of transition t up to 
and including time T, ee 
and T(r,t) denote the number of terminations of transition t up to 
and including time T. 

Then, if M(0,p) is the number of tokens on place p at time zero, and 
M(r,p) that at time T, the following must be satisfied: | _ 
in) 7 M@P) + 1(r,€ 41) #04 7,5) 
We will refer to this as the token halanca saqustion, amd 'will ‘make use 


of it in later sections. A word now about the choice of a firing time 


M(O,p) + TT tte. T(r,t 


for a transition in a Petri net. 
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We have defined T, to be a ‘non-negative xeal: ‘number, thereby agsu-_ 
ming that the duration of each‘ activity is fixed. This may not .be a very 
accurate picture of real-world systems, for , im practice, the execution 

- time of an operator depends upon the data it:-is cakled upen to handle. 
In a floating -point adder, for example; the add time will depend upon 
the arguments and their -exponénts. Thus, 1¢ may Be more reasonable to 
assume that a4 transitfon‘ffring time is a ratidem wariable whose distri- 
bution can be represented by a rectangular distributiqn of the form 


shown iri Pigure 4.1.2. 


‘-.. . Apsamed distribution. 


Se : 
PE : iS be 


More realist ic 
| assumption 


Figure 4.1.2 “+ |. 


We will return to a consideration of statistical firing times in Section 


4.5. 
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In Chaptere.2 and 3, wa introduced the ideas of firing sequence and 
behavior graph to characterize the action of Petri nets. For timed Petri 
nets, we have postuleted the existence ofa real-time axis. against which 
the firing of transitions can be noted. At time zeno, the net has a. 
marking M(0). Transitions are allowed to fixe, .qnd the time at which 
the firing of a transition takes plane is recorded ina table known. as 
a firing schedule. 

A firing schedule for a timed Petri net is a get of sequences of ini- 
tiation and termination times for the transitions of a net. The firing 
of a transition is feasible if the transition was enabléd ‘at the inst ant 
the firing was initiated. If every firing in a firing schedule fs fea- 


sible, the firing schedule is feasible. A firing schedule is infeasible 


(or not feasible) if it calls for the initiation or termination of an 
activity earlier than.allowed by the termination of other activities. 
Figure 4.2.1 is an example of a timed Petri net and Figure 4.2.2 gives 


a feasible firing schedule for it. 


2 Ld 
To" 


Figure 4.2.1 <A Timed Petri net 
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Number of 


(29, 31) 


Figure 4.2.3 Infeasible Firtng Schedule for Tiited Petri Net in Figure 4.2.1. 
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Figure 4.2.3 gives an infeasible firing schedule for the same net. The 
starred entry implies that the:third initiation of transition ts takes 
place at T = 30; this is impossible, because the third initiation of 


2 
which happens at T = 31 units. 


t, cannot take plaee before the termination of the fifth firing of t,,. 


Readers will note that in the feasible firing schedule of Figure 
4.2.2, the initiations of transitions t,, t, and t, 
lar intervals. For, example, consecutive initiations of transitions ty 


take place at regu- 


a fPo 


and t, occur at intervals of 12 time units. Transition t behaves in 
a slightly different fashion. We notice char the : RSTEC, third, fifth,... 
transition. initiations occur every 12 ‘time ‘unite, i. 3 ‘alternate 
initiations occur every 12 time units. .... «4 

A firing schedule with this property is termed a periodic firing 
schedule. If all transitions in the net can have consecutive initia- 


tions at regular intervals, we would term this a; strongly -perfodic | 


firing schedule. rnee comput ation rate of a erangition ie the iavet age ie 


number of firings of that transition in unit time. We can see ‘that 


1 and t, have a computation rate of, once every R time al 


units, or 1/12. Trapsition t 


transitions t 


3 has a ee rate of twice every 


12 time units, or 1/6. These comput ation. rates are “che maximum rates 
possible for the transitions. 

The Petri net in Figure 4.2.1 is seen to be an LSP net, so that it 
represents a deterministic system, and its steady-state equivalent net 
ig the multiply~-labelled. event-graph. shown ig Figure 4.2.4. The | 
3 is 2, while that of both ti and te is 1. 


A consistent current assignment for the net aT = 9, = 1, 93 = 2. 


multiplicity of transition t 
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Figure 4.2.4 Btesiy-itate eclivaient net for the Petri 
net in Figure 4.2.1 


We note that the computation rates of transitions: tp t, and t, are in 


“eRe ratio “of their currents ia. a consistent current assignment . Before 
ea 

we can | Justify ane we “look informally é the ‘comput ation rate of 

transit ions in an ‘event graph; mUccess tye generalizations of the ideas 


presented below will iced us to all results of ‘interest in this thesis, 


a 


including the ee rate of transitions ‘tn timed . Petri nets. 


— .« Consider the simple circuit shown in Figure 4. 2. Se in which each” 
transition t, bas an associated firing time Ty, . We will begin by agsu- 
—.-.. Ming that the circuit has one token on it, and pti ee extend the ana- 


. ysis to.n tokens. With one token on the circuit, each transition can fire 
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in turn over and over again. Suppose that the circuit is denoted by 
PytyPoty-+-Pyey » and suppose (without loss of generality) that initially 
the token is on place p,. Let t= 7, +7, teach T) Then, the token 
fires every transition in the circuit in_tyrn and reappears on p, every 
m seconds, assuming that no time is allowed to elapse between a tran- 
sition being enabled and being initiated. Under this assumption, every 
transition initiates at intervals of x seconds, and x is the period of 
the firing schedule for the circuit that realizes ‘the maxipum comput at ion 
rate. The computation tate is easily seen to be:p = 1/x. Now suppose that 
the circuit has n tokens on it instead of 1. For the combineid action of 
the n tokens, the firing rate becomes” afx; or nu < Every transition 
Et, 
in the circuit has a << ere rate given by ents expression. 
Let us now consider ticaaittons: to a timed event a We know 
from Theorem 3.2.1 that in an event staph with a live marking, the token 
content of every simple circuit is non-zero. Thus, let the token content 


of a circuit C. be Me where m ¥ 0. Also, let x denote the sum of the 


firing times of transitions in CG. il every circuit were by tteelE, the 


oy Ae ou 


transitions in circuit C, would have a 1 computation rate of | m / m 
However, in a strongly-connected event graph, the eircutts are inter- 
connected and intuitively it is cigar that ‘they will affect ach: others 
natural computation rate ({.e., the computation raté ‘the trahsittons in 
a circuit would have if the circuit were isélated°from ‘the other cir- 
cuits). 
Now, without loss of generality, let C, be the circuit with the sma- 


llest natural computation rate n/*,- Clearly, all transitions tyqreeetyy 
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Figure 4.2.5, 
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€ Cy have a computation rate that cannot exceed n,/*, (see Figure 


4.2.6). 


Figure 4.2.6 


Extending this argument, we see that if there exists a directed path 
between any two fale teteee in the event easly; they must have the same 
computation rate. Since every transition is on some directed path from 
transitions tip tio: eeetyy> we conclude that all transitions in che 
event graph must have the same computation rate, which cannot exceed 
n,/x,. What this means is that in any timed event graph (6,0), all 


transitions have the same computation rate, which cannot exceed 
min { Ny /Hy 006 n/a, 
where Cys0+-C, are the simple circuits of the graph, 


n, is the token content of circuit Cy» and 


i 


x, is the sum of the firing times of transitions in circuit C,- 


i 
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We have been able to obtain,informally, a bound on the computation rate 
of transitions in a timed event graph. In the next Section, we will 


show that this bound can actually be realized. 
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4 d nt ‘aphs 


In the previous section, we saw informally how we could obtain a 
bound on the steady state computation rate ofa timed event graph. 
Here, we will show that the bound we have obtained is actually 
realizable; furthermore, we will show that there exists a strongly 
periodic firing schedule for which the transitions have a computation 
rate equal to the bound. 

We will use the notation for event graphs given in Definition 3.3.2. 
To recapitulate, let 4 be a timed event graph (G,Q) where 

G is a strongly-connected event graph (V,A). 
Recall that V = {v1 ,Vo0---¥) is the set of transitions. 
A= fa,,} © Vx V is the set of arcs. 
45; is an arc that connects transition v, to transition Vy: 


Q is a function that assigns to each transition v, a firing time Ty 


i 
i.e., Q: V —» RR (where R is the set of non-negative real numbers). 
We. write T; for Nv,). 

Let M(O) be the initial marking of the net. Recall that M is used 
interchangeably as a vector and a function. For event graphs, 


M(T,a; 4) will denote the number of tokens on arc a,, at time 7. 


1j 
Arcs in event graphs will be treated in the same way as places in un- 
restricted Petri nets. 

We now proceed to formalize the definition of a feasible firing 


schedule for timed event graphs. 
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Definition: A firing schedule S for a timed event graph 4 is a pair 


2 


(Sy), s,) such that. 
Ss, 7 xV—>® 
Spi 1x VR, 1 = (1,2...) 
and S,(n,v,) = S;(n,v,) + T;- 
S,(a,v,) is the nth initiation time of transition vy- 
S,.(n,v, ) is the nth termination time of transition Vy. 


A firing schedule for a timed Petri net: may not be realizable because 
some transitions are specified to fire when they are not enabled. A 


feasible firing schedule is one that does not violate this requirement. 


We now show how to determine the marking M(7) at any instant of time Tr 


for a timed event graph & which has. feasible firing schedule S. 


4 


Notation: At any instant of time T in a firing schedule 3 for timed 


event graph , 


I3,v,) denotes the number of initiations of transition Vv, uP to and 
including time 7. 


T, (T,v,) denotes the number of terminations of transition v, up to and 


i 
including time 7. 


Let ayy = yy 


) be an arc in the event graph G. 
By token balance . | | 


fee 


M(T,a8, 5) = MQ,a, 4) + T,(r.v,) - T,v,)- 


Now congider the enabled transition v j shown in Figure 4.3.1. If an 


initiation of ‘5 takes place at the instant tT, then immediately prior. 
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arc of vy: We will use the notation 7” to denote an instant (r-€) where 
€ =» 0. +r” will, for all practical purposes, denote the instant T 


itself. 


Definition: A firing schedule S for a timed event graph His feasible 


iff for any transition v and for k = by ee ee we have | 


j 


Va, € "vy, MOS; [k,v,]", a,,) = 1 


where Sy{k.v,] denotes the instant just prior to S,[k,v,]- 


+ 


Input arcs of 
transition j 


Figure 4.3.1 An instant when trans ition Y, can initiate. 


Suppose we now consider a run of the event graph 4. Consider an 


arc a,, that initially had M(0,a, ,) tokens ‘dn it: Sdppose transftion 


j 
has fired n times up to and including an instant-oftime*r.. It is =~” 
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obvious that in the same interval of time, transition v 3 could have 


fired at most M(0,a ip + n- 1 times without Rete S the token content 


of arc aij negative. The (00,8, > + n)th initiation ‘of transition V5 


must await the nth termination = transition v,.: Wé’will show in the 


in 
next Theorem that this is a necessary and sufficient condition for a 


firing schedule to be. feasible. 


Theorem 4,3,1: A firing schedule S for a timed event graph 4° = (6,0) | 


is feasible iff for each arc ay in the graph G, 2 for n= 1, 2 3ees 


S,(n,Vv,) < S,(n + M(0,a,,), Vv): 


ij 
This can be stated eter eney as 


s 1,¥,) +7, < Sy (2 + MO,a,,), ¥,)- 


Proof: 
Necessity: We prove ‘this ‘by contradiction. Suppose there exists 
a feasible firing schedule such that for some ‘n ‘and ‘gome aij 


S,(n,v,) > Sy (n + M(0,a,,)> Vy) i.e., 


S,(n,v,) + T; > S, (a + M(O,a, 4)» v4) 


Let Tr = Si (a + M(0,a, 4)» v,) and therefore 


T5@sv,) =n+ M(O,a, » 


j 
Since 7 < S,(a,v,), we ‘have Ty (r3v,) <n. 


tt Bes 


Using token balance, 
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M(T,a,4) = M(0,a,4).+ Tg(sv,) - I(t, vs). 


) 


= M(0,a,,) + Te(rsv,) - n - MO,a, , 


= Tg (r,v,) -n 


i.€.4 M(T.a, 5) <0 


This is the desired contradiction of feasibility of S. 


Sufficiency: Suppose S is a feasible schedule, and consider any instant 


of time rf > 0 such that 


We wish to show that for the instant T just prior to an initiation of v5 


Mr ,a,,) 21. 
Now, at time Tr, 
M(r" a, 4) = M(0,a, 5) + To(r-,v,) - @+ 4(0,8, 4) - 1) 
=I5(r.v,) -nt+1 
but 
Tyr" sv,) 2n. 
Hence, 


M(T™,a, 5) =1 . A 


We have introduced the idea of a strongly periodic firing schedule in 
Section 4.2. We now give a formal definition of a strongly periodic 


firing schedule for a timed event graph. 
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Definition: A feasible firing schedule S’ for a timed event graph: § 
is strongly periodic with period %™> 0 iff there exist real jhumbers 
x, 2 0 such that 
S,(n,v,) =X, + (n-1)%. n= 1,2,... 
i= 1,2,4..m. 
The real numbers KpoeeeX, are the displacementg of the firing times of 


transitions from the instants 0,%, 2x,... 


Corollary 4.3.1.1: Let a4; be an arc in a timed ‘event graph 4. ‘Then 4 
has a strongly periodic firing schedule with period «> 0 and with 
displacements Kypoee eK, iff Vas; € °G, ‘we ‘have 


). 


x, +T, < %, + M(O,a,, 


Proof: Theorem 4.3.1 states that Y a €° G-aad for‘n-=1,2,... 


S;(@,v,) +7 s S;(n + M(0,a )»» vj)--- [4-1]. 


i ij 


By the definition of a strongly periodic firing schedule we have 


S,(a,v,) =x, + (n= 1)n 


ij 


and S,(n + M(O,a, 4)s v,) = x, + (n + M(0,a ) - 4), 


Substituting into inequality 4.1, we have 


x, + (m-1) +7, <5", + (n + M(,a,,) «ay 


j 


oe a T 4. se 4 ee M(O,a 


14) ee 3 af 


The inequalities of Corollary 4.3.1.1 canbe'tewritten’ *~- 


X57 % 2 T; = M(O,a, ,) ¥ a,;€ G. 
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The quantity rT, - 7M (0,8, 5) is a constant for each arc a,)- 
Let us write 


TT 7 m™(0O,a,.) =c 


ij ij” 
Corollary 4.3.1.1 can be rewritten as follows: /.. . 
A timed event graph 4 has. a strongly periodic firing schedule for 


displacements X,> 100K, iff 


-x, 2 ¢,,...[4.2] 


Vv 55 € G, we have , 4 44 


where c - 7m™M(0,a 


he ee | ij)” | 

We would now like to investigate under what condi tions the set of 
inequalities [4,2] is true. In order to do this, we make a brief 
excursion into the theory of potentials for directed graphs. The 
development given here follows very closely the material in Berge and 


Ghouila-Houri, pages 144-145 and pages 155-157 [B3]. . 


4.3.2 Existence of Potential Differences for event graphs 


tn es 


Suppose with each arc a,, of an event graph G we associate a real 


ij 
number 84; which satisfies the condition that for every cycle € € G, 


2% - 245 a 


a,,€ 8° a,, & 8 


where &* denote the arcs of the cycle oriented ina given sense, and §~ 


the arcs oriented in the opposite sense (see Figure 4.3.2). Then, we 
will say each 84 represents a potentigl difference across its corresponding 


are ayy: Readers familiar with circuig¢ theory will notice the similarity 
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¢ 


- pt 
19> #937 83, © § 


ay & = 


Figure 4.3.2 A Cycle 


of this notion of potential difference to that used for electrical 
networks. In networks we can choose any arbitrary node to be a 


reference node and assign it a potential of zero.. The potential of any 


node in the network is then equal to the potential difference between 
it and the refet@nce node. We-will now show that if:the ates ofan. 
event graph can be given a potential difference assignment, then it is 
equivalent to saying that-each transition. v, “can be-assigned a. potential 
Xs such that the potential difference 8, j eR tse have. used: x, wot 


to denote the potential associated with transition v,, and -have. used x 


earlier to’ denote the displacement associated withev, 3° - This has. been 


i 


done intentionally. 


Theorem 4.3.2: A function @: A -—®R. which asgigns a real number O53. 


to each arc a, j is a potential difference assignment iff there exists 


a function X: V —*® which associates with each vertex Vy in the event 
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graph a real number x, such that for every arc a,, we have 


i ij 


855 = | - X;- 


Sufficiency: If 6,, is defined as given, then consider™a cycle 


ij 
& = (4 2V9> ae “Vy9V})- 


od 


Define § = +l if a,, is directed in one sense 


as; ij 
g = -l‘if a,, is directed in the opposite sense. 


Then, ba12 812 =X, 7X 


Summing, we get 


> ®; - ) My = 0. 


Thus, ‘the assignment @ isa potential difference aasigneent. 


Necessity:. If. @ is:a potential difference assignment, let us define .. 
the potentials. x, step hy step. eaogthbh folds eau. 
Take an arbitrary vertex:¥, and: assign. the coeffielest x, = 0 to it. 
If vy has been labelled, end.v, has. not yet, been; dabalied,.and if 945 
is an arc in A, then we write 
x a x, + e, j" 
Similarly, if e,, te amare in A, then we write 


x, =x, +0 


1 j ij 
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or x, =X 


j 
Since the event graphs we are interested in are strongly connected, 
the potential of every vertex can be assigned thts way. | 

The potential assigned to a vertex vy ie. usitquely defined; for 


otherwise there would exist two chains Gy and.¢, going from v, to 


1 
Vo such that they form a cycle around which the potential differences 
do not sum to zero (see Figure 4.3.3). This would violate the 


definition of potential difference. 


Ce 0 Qe + Qn 


Figure 4.3.3 


We now examine the conditions under which a potential difference 
assignment exists for directed graphs.‘ The following Theorem holds 


for all connected graphs (i.e., not just strongly connected graphs). 


Theorem 4,3.3: Let G = (V,A) be a connected graph, and let us associate 


with each arc a, € A an interval [k where k, < 1, A 


ij’ 1y4]s 


j j i 
necessary and sufficient condition for the existence of a potential 


difference assignment. 
- such that . 


Ky < 8; < 1, 
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is that for every simple cycle &, 


yy j . Yes i 


ayy € &- a5 € et : 
D4; = Dee j 
ar5 € et a,, € e- 


Proof: (Suggested by Jack Dennis; we disagree with the proof in B3). 
Necessity: The condition is necessary; for, suppose such a potential 


difference assignment @ exists. Then, we have, for a cycle &, 


0843 . ds =s 


re € §* ayj € = 


Lf we use the lower bound of ky for each arc in the first term and the 


upper bound li, for each arc in the second, we have 


* 


243 = Yks5 


" + 
as, e§ a,4€ 4 


By doing the converse, we get the second inequality.* 


Sufficiency: Here, we shall use the concept that each interval.represents 
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a set of points on the real line. We will write 
Ya37 [ky 4, 15] = (x|x 2 kjj, xs 1,4). : 

We will use the concept of the interval sum of two intervals A and B, 

defined as 
A+B = {a+ b| a € A and b € B}. | 

Also, the interval difference of two intérvals A and B is defined as 
Wee ere err a ere | | 


‘The conditions of the- Theorem can be written as - 


yy, ‘2 Dey : 5 - aa ; 


a, €5 gee ” 
and ac. “s fee es 
Tn Se ee Me] 
ayy € e- ee, € gt it a aes er eam eee emma 
Now consider the’intervel "  ° - at “ 
#1, €58 aa de TS oe ca te ac 


From [4.3] and [4.4], 
OE]... [4.5]. 


But J can be rewritten as °” 


J | Yess , Y45 - [ Ys a 4, ] 
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oot. = Digs . «enti 
a4 ¢ a #,, € g* 


From [4.5] and [4.6], 


nee = phe wee 14.7] 
5 a,,€5° 

Thus, the conditions of the Theorem can be written.ag in [4.7]. We shall 
show by induction that [4.7] implies the existence of a potential difference 
assignment @ such that k, < a, <1, sos This is ‘true for a graph with one 
arc; we shall suppose that it is true for wed graph with (m - 1) 
arcs and show that it holds for a graph G with m arcs (labelled 1,...m) 
with the intervals Yyoeee¥, subject to conditian [4.7]. Let t be a 
graph with .(m - 1) arcs that is obtained by deleting arc h = (V45¥,) 
from G, and let 5 be the potential difference aspigement, for. G, Let. 
&, be the potential difference across the vertices VyoVy in the graph G 


h 
under the perentse? difference destnmank 6. If bh € Y,, then by adding 


arc ah) to G aad assigning it a potential difference a, = bw we have 


= ee er Se 


constructed ‘for ‘the gtaph Ga potential difference asrigmment 
@ = (,,- o0sO ys 6). 


If S, does not lie within the limits fk tJ, then there are two 
; Peg TT! Tweet aa 


h 
possibilities: 


Either (a) bi < k, 
or (b) &y > Le 


We will assume without loss of generality that for all simple cycles 
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Figure 4.3.4. 


& € G which contain the arc h, he &. In G these cycles correspond to 


simple paths from vy to Vie Thus, for each simple path S from v, to v 


i j 
have 


; € -~ ¢. h 
94 g aq &5 
a #¢h = 
or bE ~ Y pa - Y¥oa 
- + 
aq & 5 aq © § 
a #£¢h 


We will refer to the set of simple paths from v5 ‘to ‘vee the hammock «3. 


# = {sle is @ simple path from v, f 
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Case (a) & < ke We will show that it must be possible to reassign 


potential differences to arcs ig the hammock # 80 that 5 2 k We do 


this by finding a cut set C_ ¢f the hamiock M(see Figure 4.3.5). 


i 


ff \ \ 
ye 2a Aue) 
Ornella ® \ 
l rf Aa Me 8 ‘ 
‘ 
bos - . ‘ } 
bys ‘ : : 
\ a a 
t / yi 
\;/ rid 
\y i 


oe 


Figure 4.3.5. Hammock BH, 


With each arc m = ad in the cut set G» we associate a slack On defined by 


-_ = + 
On kK, o, if me e&. 
On Om ~ an ifmeéeé. 


Let 3° = min [a,>---,3,] 


1,...¥r ec, 


We increase the potential difference e of each arc m € Ee by 3” and 
decrease the potential difference @ ‘of ach arc m€ 87 by aoe ees 
Thus bh has been increased by ‘ae. ned 6p-+ 3" aks 8 are done. 


Otherwise, repeat the above procedure for some other cut set, and so on. 
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If no more cut sets ar&é found for which ‘thts potential difference reas- 
signment can be done, then there must be some pe@th'® in which the © 
“following is true: 
+ 
Ea ko Vmeé 


Sa tm Eero 


ee 


For as long as there as no “such path, there met exiet a cut set whose 


arcs do not have potentiet differences ‘eeiak to Gites limit. 


Now consider the cycle consisting of the path s and are k. 


= ye = Yn <k, 
meee mes 
ae 


tA ig ste 


seviaiing that xk was defined to be in &, we see ‘that 


re Sa ee ee a 
Be - th <e 


me€ et me e- FS geeky 


which contradicts the first condition of the Theorem, namely that 


Yas - Yes 


13 € e~ ayy é€ et. 


Similarly, we show that for Case (b), ‘ e., 8, - 5 at mae be possi bis 
to reassign the ‘potenttale in the eae Hae: age és ios If nes 
we can show by a method similar to that for case (a) that there must 


exist a path for which — one ae the ow 
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which contradicts the second condition of the Theorgm.. ... 


This proves the Theorem. 


Corollary 4.3.3.1 


A potential difference assignment @ such that ®, j 2 k, j (for all 


ai, € G) exists if and only if, for every circuit &, 


dey <0 
*5j €§& 
Proof: Set ly = +o for every a5 There are two types of cycles: 
(a) circuits, i.e., cycles in which all arcs are oriented in the same 
direction. . | | ae oe 
(b) cycles which are not circuits. 


For every circuit €, we see that 


Yeas s 0. 


re = 


For cycles which are not circuits, we see that 


ye , => ” "which is vacuously true if all ky, 
£4 Pal tae 


's are finite. =. 


We are now in a position to show that the bound we gbtained on the . 
computation rate of transitions in a timed event graph is actually 


attainable. This forms the subject of our next .Heorem. 


~136- 


Theorem 4.3.4: Let 4 = (G,Q) be a timed. event graph. 4 has a strongly 


periodic firing schedule with period m > 0 iff 


™ 2 max 1 k 
vs )MO,a,,) 
ayy € Cc, e 


where (C,,C,,-..C,} is the set of all circuits in G. 


Proof: From Corollary 4.3.1.1 we have the result that a timed event 
graph 4 has a strongly periodic firing schedule with displacements 
KyseeeX, and period n iff 


Va,, €G, we have x, - x, 2c 


where c - 7M(0,a [4.2] 


4377 13)" oo es 
Now with each arc a,y> we associate 845 = 357 Bye | 


By Theorem 4.3.2, a function that assigns such-a .6,,.to each arc a 


4j ij 
must be a potential difference assignment. The set of inequalities [4.2] 
now becomes 


Va; 


4.8 
a * “ay ie be 4 see ole 
By Corollary 4.3.3.1, this system of inequalities pas a solution iff, 
for every circuit & € G, 
ye45 <0 
€§ [4.9] 


[4.9] now becomes 
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= yrs - on JMO,a, p <° VEEG, ‘ 
Vy € Eg #55 Meet = 
or T2 yi 
MgSh us MES 
YH0,2, .) 
a5 €5 


This is true iff 


" = Vv. € g i Pe 4 “8 . ply 
e€G i ‘ gs oo [4.10} 
dmee,e, ,) 
e 


Comments_on Theorem 4.3.4: We note that there exists a sttongly periodic 
firing ‘schedute with’ period tee Ba® See S. ee se eet 8 


7, = max yh 


Vv 


E€G £5 


Hea, 5) 
ee €& 
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The computation rate p of transitions in G is given by p -+ 7 


Hence, - ne : 
ps p' = ain Poy) : ‘ : 
eeGy A € [4.11] 
an 
vs 4 


It is clear from fig caieeuatity .2z that there exists a strongly 
periodic firing schedule which realizes ‘computation rate 
p =p! | | 
Thus, the timed event graph can be allowed to “run” at any “speed” less 
than or equal to ‘of. In Section 4.2, we argued that the computation 
rate of transitions in a timed event graph could not exceed that of 


transitions in the circuit with the minimum ratio of’ 


but our arguments were non-rigorous. In this section we have sub- 


PE et : scant 


stantiated our claim in a eigerous fashion. The circett with ‘the | 
‘minimum ratio of token content to sum of transition firing times is 
‘oraad the critica ) 1 cireu cuit: ‘and as vet have ener, this is the one 
which determines the alte computation rate of all transitions in | 


the net. 
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Section 4.4 Timed LSP Nets 

We now proceed to apply the result obtained in the previous Section . 
to determining the maximum computation rate of transitiong in a timed 
LSP Petri net. Our starting point will be to decide if a given marked 
Petri net is indeéd LSP. This can be done by applying the results deve- 
loped in Section 2.3. | 

Consider a timed LSP Petri net £ = (9,0), The timed Petri net 
in Figure 4.2.1 is an LSP net, and we saw that we were able to constr- 
uct a feasible periodic schedule for it. We now show that every LSP 
Petri net has.a feasible periodic firing schedule, and we show how to 
find the period and the computation rate of every transition in the 
net. ; ; = whe, 

Recall from Section 3.4 that the steady state behavior. of an LSP 
net P can be represented by the steady-state equivalent net 4. 
Furthermore, recall that J has the structure of a multiply-labelled - 
event graph. The only difference between event-graphs and alee: 
labelled event-graphs is that that the latter may have multiple dna. 
tances’ of certain transitions. We will begin by assuming that all 
transitions ina pai ie irae ha event dade have distinct labels, 
and the steady-state equivalent net Ie can be regarded a os event graph. 
Each transition is assigned the firing time oF the transition ig ortus: 
ponds to before the relabelling was done. The ‘computation 1 rate of trans 
itions in the meow ie event E graph can he pe fod by eopiying ‘the : 
result of [4.11]. . . 

We showed in Chapter 3 that the steady state equivalent net of j 


consistent LSP Petri net must have a number of occurrences of each 
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transition equal to its current in a consistent current assignment. We 
now show that the computation rate of transitions in a timed LSP Petri 
net must be proportional to their currents in a consistent current 

assignment. To show this, let us consider the steady-state equivalent 


net J . We have seen that we can find the _computaly.bn rate of tran- 


Poel ? 


sitions in the relabelled net ey treat ing it as an event staph. We 


will call this the fundamental computation rate of he, 6? net, and will 


, “ot 


denote it by p. If, the ‘multipitertty of ‘any transition % is UCYy)> it 


means that in unit “tine; “sxaneition v, in the eiga LSP ‘pat: z. ‘fires 


i 
uv, ) X Pp times, because each firing of ‘we -ingt ance. of transition of 


ne 


transition v, in J is also a firing of trans ition Ys in FP. Hence, 


i 
we can find che computation. rate of any transition “4 es sively multi- 


ic 
ner 
a ee ad 


plying: 'p by the multiplicity ‘of ‘that exgneitioa in J.” 
Summarizing, the following steps are are involved in finding the 
computation rate of transitions in a timed LSP Petri net £ : 


Step 1 Find the steady state ocutnatent net 4 off tye ‘ondagigiag LSP 

eo es Ps ° 

Step 2 Treating 4 as an event graph, Stpedee maximum computation 
rate p by applying ics 11]. Thte/ shear ba maximum fundamental 


Petri net 2. 


computation rate of the timed LP Petri ‘at rae 
Step 3 If Py is the —= comput atch rate. ot frqsisition "" im Fa > 
and uv, ) is its milttplicity is J . = iP =p X ee 


ae Pe 


We illustrate this method by was example. <1 <. 


Exgaplé: -Gonsider the timed LSP amis me: 96: ELEURS, 4° 2.1 (reproduced 


in Figure 4.4.1). In order to make our scuabie as general as possible, 
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Figure 4.4.1 Timed LSP Petri net wt = (P my with | a live, bounded 
marking M. 


Figure 4.4.2 Steady state Equivatent net J for LSP Petri-net P with 
marking M. 
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the net has a marking M that is not safe, but which is live, bounded 
and persistent. In Figure 4.4.2, we show the steady state equivalent 
4 of the net Pwith marking M. 4S can be decomposed into four circui- 
ts ; 
Py Pye 3P yf oPy Py 
Co? Pat yPot Ps | 
Cy? Paty PotyP,tsPy 
C,? 3b yPyt3Py PoP 
The fundamental computation rate ~ fa: sean to be 
p= wiin{ 2/12, 1/8, 2/10, 1/10] | _ 
or p= 1/10. | 
Now, u(t, ) = 1 
u(t.) = 
u(t.) = 2 | | 
Thus the computation rates of transitions ty st, and t, are as follows: 
p, = 1/10 | 7 
Py = 1/10 


P, = 2/10. 


A note on the Structure of LSP Petri nets 


Although it is true that the LSP Petri nets that arise in the course 


of modelling practical systems will all be SMD there do exist Petri nets 


which are LSP but not SMD. The net in Figure 4.4.3 is an LSP Petri net, 


but it is not SMD as the reader can easily verify. The method we have 
given for finding the computation rate for timed LSP Petri nets holds 


for all LBP Petri nets which have a live, safe marking. 
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Figure 4.4.3 An LSP Petri net that is not SMD 
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The assumption of deterministic firing times OE rere nets has given 
us a great deal of Ansight into the action sa practical asynchronous sys- 


tary 


tems. We have been able to find the Pompatetton rate of transitions in 
timed event graphs and LSP Petri nets, and in each “came ve have been able 
to identify system components that limit the fcndemented ‘computation rate 
of the net (i.e., the critical circuit). The critical circuit represents 


a bottleneck in the system being modelled, and readers familiar with 


PERT networks will see the correspondence between critical circuits in 
timed event graphs and critical paths in PERT networks. 

The assumption of deterministic firing times may not be sufficiently 
accurate for practical systems. In practice, an action in an asynchronous 
system like an addition will be a random variable , and in a large number 
of cases this can be approximated by either a rectangular or a Gaussian 
distribution (see Figures 4.5.1 and 4.5.2 respectively). A Gaussian 
distribution has the problem that it has a "tail" which extends for nega- 
tive values of time. Since we cannot visualize an action taking less than 
zero time to occur, we would prefer to use the rectangular distribution 
shown in Figure 4.5.1. 

It is possible to find out the mean computation rate of a timed event 
graph when all firing times are given by a distribution like the rectan- 
gular distribution. However, the process is very tedious, and we will 
work with the means of each of the firing times. By doing this, we get 
a timed event graph whose computation rate can be found by applying Equa- 
tion [4.11]. It can be shown by using the results of Clark[(C3] that the 
computation rate so obtained is an upper bound on the true computation 


rate of the timed event graph whose firing times are random variables. 
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The deviation of the true mean computation rate from this bound depends 

upon on the standard deviations of the random variables, but there does 

not appear to be any simple expression that relates these quantities. 
We now turn to Chapter 5, where we find a bound on the computation 


rate of timed SMD Petri nets. 


CHAPTER 5 


COMPUTATION RATE OF ASYNCHRONOUS SYSTEMS WITH DECISIONS 


5.1 Timed SMD Petri Nets 


In Chapter 3, we illustrated the power of SMD Petri nets in model- 
ling asynchronous systems with Wecisions. We alsp established two 
important results about SMD Petri nets: | : 

(a) It is decidable if any given region for an SMD Petri net is 


a 


live. : 

(b) For SMA Petri nets, any marking which cee at least one token 

| on every state machine is live. ; | 5 
Thus, usiag the results presented in Chapter 3, we.cen design asyn- 
chronous processing systems for possible jes emase aktion by the tech- 
niques developed by Dennis and Patil Int, | b2]. ; ‘We a show how we can 
estimate the computation rate of this moke general, class of systems. 
Recall that in Chapter 4 we developed a teehnigas- for. ‘finding the 
maximum comput at fon rate of a large clase_of deterministic systems. 
In this chapter, we look at sha teteialness systems and see if we can 
obtain a bound on their computation rate. We.do this by finding a 
bound on the computation rate of trasbitions ti a timed SMD Petri net. 

We have; defined timed Petri nets n Section ih. 1 (ese definition 4.1.1). 
Thus, a timed SMD Petri net is an SMD het in aaah every transition has 
a fixed, non-zero firing time. For detatis of the firing mechanism 
and notation for the instantaneous marking ‘of & timed SMD net, the 


reader is urged to re-read Section 4.1. 
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Stage 3 


350 HP cars 


queues 


-L7T- 


Decision made at place p,. e 
Figure 5.1.1. 
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Let us now consider a timed SMD Petri net and examine its be- 
havior. We do this by constructing a firing scherele for it, and 
interpreting the notion of couput ation rate of a transition ta con- 
nection with the firing schedula. 

As a simple example to mot ivate this chapter, couetier the SMD 
Petri net model of an assembly line with decisions of one type eine 
cussed in Section 1.2. Figure 5. 1. 1 is such a Petri: yet tmodel. Let 
transitions a, b and c represent aspexbly operations, and let’p be a 
place which represents a bay at which a Secieite is made. {about which 
engine should be attached to a chassis. The engines are ‘available in 
the input queues connected to transitfons is and c, and opmiecet as- 
sembled autos are output into Sue queues marked “output. queues." Each 


of the assembly operations represented ~ the transitions a, b andc 


have associated time durations TT a’ “Tt and T. eppectively. A partially 


Th? 
assembled auto that appears at place p can be rquted: one of two ways, 
by firing either transition d or Fransition a. We ete in Section 1.2 
that this routing can be done in several ed and the exact mechanism 
of making this routing deeieiee does not coycern us ‘bere. We have 
pointed out some of the ways in vhieh this decision fan be made for a 
practical assembly line. From a performance standpoint,” we must know 
the relative numbers of each type of siitomobile whitch are produced by 
the assembly line. Let us see why. ~ Suppose’ .. Large proportion of 
all cars produced by the assembly line are a HP + aba and suppose 
that the final assembly process for attaching a 200° HP engine to a 


chassis is extremely slow (i.e., transition f has a long firing time). 


This assembly stage will not be able to handle the load imposed by it, 
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Figure 5.1.2 Occurrence Graph for Petri Net in Figure 5.1.1. 
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and will slow down the operation of the whole line by choking off the 
absorption of subassemblies from the input queues. It is intuitively 
obvious, therefore, that this assembly line can be balanced by matching 
the speed of operation of an assembly station to its expected frequency 
of use. It can also be seen that there is a tradeoff between the num- 
ber of physical processing units at an assembly station and the speed 
of each processing unit - a number of slow processing units can be 

used instead of a few fast ones, and vice versa. Our question in 
connection with this assembly line is - how do we estimate the rate at 
which it produces assembled autos? 

Let us examine our model carefully and see what information is needed 
to specify the system in a way which is complete for our purpose. We 
have shown input queues connected to some of the transitions in the net 
in Figure 5.1.1. We pointed out in Section 1.2 that we will assume 
that there is always at least one item in each input queue, so that it 
is not necessary to indicate the queues, but we have shown the queues 
for completeness. Let us now see how to-incorporate the relative fre- 
quency of use of the two alternative assembly stations into out model. 

The action of the Petri met in Figure 5.1.1 can be represented by 
means of an occurrence graph (see Section 3.5.3). In the occurrence 
graph shown in Figure 5.1.2, the probability or the relative frequency 
of occurrence of each of the outcomes of the decision made at place Py 
will be reflected in the number of occurrences of the corresponding 
transition in a long frustrum of the occurrence graph. For example, if 
2 


the probabilities of occurrence of transitions d and e are : and 3 


respectively, then, in a long frustrum of the occurrence graph there 
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will on an average, be twice as many occurrences of transition e as of 
transition d. This means that in a consistent current assignment for 
the Petri net, the current assigned to transition e will be twice that 
assigned to transition d. Our problem is to find the maximum comput ation 
rates of transitions ts. and ts in the net. In general, we would like 
to find the waximum computation rate of any transition in a timed 
SMD net, for which a consistent current assignment is given. We wish 
to find a computation rate assignment such that the computation rate 
of each transition is proportional to its current in the given con- 
sistent current assignment . 

The problem can be tackled by simulation. In such a method the 
computation rate of any transition can be found by letting the net "run" 
for a long time, and then dividing the number of times that transition 
has fired by the total amount of time that has é¢lapsed since time zero. 
The method is extremely time consuming and we would like an analytical 
technique which enables us to find a good bound on the computation rate 
of transitions in the timed net. 

Consider a timed SMD net x = (P, 0), having a minimal integer con- 
sistent current assignment %. We know from Section 3.5 that several 
c-equivalent nets are, in general, possible for x with consistent current 
assignment §. In Figure 5.1.3(a) we reproduce the net shown in Figure 
3.5.4. For the given minimal consistent current assignment, several 
c-equivalent nets are possible, and we give two of them (see Figures 
5.1.3(b) and 5.1.3{c). Each timed c-equivalent net defines a periodic 
schedule. We compute their fundamental computation rates, and get 


the following: 
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The SMD net X. 


Minimal integer consistent current assignment: 


9, = 9, = 1 P= Pe = 2 


We = Py 7 95 ae? 
Firing time assignment: 


T, = 2,7, 73,7 


= 4,7 


a1 ee By tee oe Ty 


3 4 5 


Figure 5.1.3(a) 
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Figure 5.1.3(b) 
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Figure 5.1.3(c) 
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Applying the tesults of Section. 4.4, 
faites Caipstarice tes y= ste(t, tye bd, 


Fundasient al Comput atiow Rate 9° = uta [t= os 4, 1|- bya 


Thus, the maxitunt Fendamental Computation: Rates of the two c-equivalent 
nets are different. In oréer to find a Hound on the mecimes fundasent al 


computation rate of the timed net %,. we mm equivalent 


net witich lias the largest computaéton rate. 

The maxiaum fundastental computation rate of a c-equivatent net for 
the timed net X represents the maximus fendasental computation rate of 
transitions in the net x for the behavior specified by that c-equivalent 


net. Tlifis leads to the following definition: 


Definition 5.1.1 The maxim fundamental computation rate of transitions 


in a timed $MD net xy = (PQ) for a miniaal integer consistent current 
assignment # is given by the fundamental computation rate of the c- 


equivalent net which has the largest fundamental computation rate. 


Definition 5.1.2 The maximum computation rate of a transition t, be- 
longing to a timed SMD Petri net x = (7,Q) with a minimal integer 
consistent current assignment ¢ is given by 

P, "= 9; X0 


where P, is the current assigned to t, by Sand p is the maximum funda- 


i 
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Figure 5.1.4 
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mental computation rate of x. 


We see that to obtain an exact value of the maximum fundamental compu- 


tation rate of X, we must find alt th ereqyiv _nets for % and 

find the one with the largeat fundamental computation rate. This is 

a very tedious process, and we would bike to find a simple method 

which gives a bound on the fundgmental maximus computation rate of X. 
Consider the timed SMD Petri net % = (AQ) in which the underlying 

Petri net Phas a live marking M. Let S,, 8),..-§, be the state 

machine components of #. Let the tragsitions of wa atate machine 

component 5; be tev tro < tay? their firing times be Tap Tio - Ty 

and their currents be Pip Pyare Pa respectively. Also, let 

Pay? Pyare’ Pay be the places belonging to the state machine S,. Let n, 

denote the number of tokens on state machine 5,. 

Consider the occurrence graph associated with one of the tokens on 


the state machine S, (see Figure 5.1.4). 


Since there are Qi tokens on the state machine S;> the maximum natural 


ia thus n,¥., or n The 


il til ars 
quantity Y, is termed the maximum natyral fundamental computation rate 
of the state machine, and to obtain the maximum natural comput ation 


computation rate of transition t 


rate of transition Kip we multiply Ln by Psy° The term maximun 
natural computation rate of a transition t,, refers to the computation 
rate it would have if the state machine S; were isolated from the other 
components of the SMD Petri net ?. However, as we saw in the case of 


timed event graphs, the components of a timed net affect each other's 
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fundamental computation rate. We now show that if two state machines 

S. and S have some transition t in common, then the fundamental 
computation rate of both state machines is min lv, »¥3l. Let 9 be 

the current associated with transition t in a consistent current 
assignment. Then, it is clear that the computation rate of t cannot 
exceed ~ X min IY, YJ For suppose without loss of generality that 
Y< ¥5: Then, the computation rate of t cannot be greater than x Yo. 
If it could be greater than 9 x Yas it would violate the definition of 
the fundamental computation rate of S,- Thus, the computation rate of t 


cannot exceed x min (y,.¥ ]. Also, by consistency, the computation 


j 


rate of any transition tay € S; must equal 
9 2 
at X X min I¥o¥ 4] = 9, X min (¥,>¥,]. 


Similarly, the computation rate of any transition a in s; equals 


P; . le 
8 xX 9 X min IY 5¥4] = ® im X min [¥,.¥,]. 


We conclude that if two state machine components S; and s; in a timed 
SMD Petri net have a transition in common, each state machine acquires 


a fundamental computation rate = min Ye. 


Let p = min [¥,.% vale Without loss of generality, let p = Yue 


g.2. 80 
We now construct sets Aye Rovere ed, of state machines as follows: 


If a is a set of state machines, then define A» 1 to be the set 


pt 


Ne U {sy S,27°°* Spy! where a $2909 Spr are state machines which 


have at least one transition in common with the state machines in Se 


Also, r. = {S,, $520++S.)> i.e., all the state machine components of ?. 
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Construct A from A until =). Then A\_ must include all 
ptl P ptl P P 


the state machines because P is strongly-comtiected. 


The reader can now see that if the maximum fundamental comput at ion 
rate of S, = ¥,, then the maximum fundamental computation rate of all 


state machines in A, is also tp and so on until we have exhausted all 


2 
state machines in the net 7. Thus, all state machines in , must have a 
maximum fundamental computation rate p = 1 min [¥,s¥o---¥ a] where 
S,> $5248, are all the state machine components of -. 


The maxisum computation rate of any transition t ; in the net is thus 
9; X Ps 
We have thus established the following result: 
In a timed SMD Petri net x=(P ,{i), the maximum computation rate p, of 
any transition t, is given by | 
P, =, x p' 
where p' = min IY s¥o0+-e¥ I 


where Y¥ Lan are the maximum fundamental computation rates of 


» Siac 
Sj; Soo---S.. The maximum fundamental computation rate Te of state 


machine S.. is given by 


es : 


a 
3 Ye kj 


j=l 


where n= number of tokens on state machine S.- teaetygeee tye, are 
the transitions of state machine S,° 5? Tj are, respectively, the 


current in a consistent current assignment for P and firing time of 
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transition th € Sy: 

The bound we have obtained here is based on the assumption that 
when a token appears at a place with conflict, then its further 
routing (i.e., which output transition to fire) is done statistically, 
using an a-priori probability measure proportional to the currents 
associated with the respective transitions. In the next section, 


we derive this bound by considering the c-equivalent nets for X. 
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In the previcus section, we saw } we can find the upper hound on 


the maximum fundamental computation @ of a timed @@ Petri net x with 

a wninimal integer consistent current’ baignment. This involved 

drawing all the timed c-equivalent shes for % and finding the ana with 

the largest maximum fundamental comput at Lon rate. Aa we peinted out, 

this te a tedious process. The fol toding Theorem gives an upper bound 
the maximum fundamental computation: rate of a timed 0 Petri net 

in terme of its structure, marking, j§§ sinimal integer conaistent 

current aseignment and firing time asatgument.. 


Theorem 5.2.1 In a timed SMD Petri net % = (9,0) with a minimal integer 
consistent current aseignwent $, the maximum fundamental comput ation 
rate is given by p'= min (¥.20-0¥ Meet are the fundamental 
computation rates of the atate machine components of x. The fundamental 


computation rate ‘, of state machine 5, is given by 


¥ apes cee 
k fr 
y Mik 
j=l 
where a, = number of tokens on state machine &° 
tepecrety. are the transitions of state machine S° 


q, ? Ty j are the current and firing time respectively of transition ~ 


thy € S.- 
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Proof: Consider the c-equivalent net € of the SMD net P for the * 
consistent current assignment $. We showed in Section 3.5 that 

for every state machine component Ss. 
circuit c.€ @ with the following property: 


in P, there exists a corresponding 
i 


Every transition t; in C,. has a multiplicity u(t Pe) equal to the 
current 9; assigned to transition t, in P by the minimal consistent 


current assignment $. Now let CyreeesC, perre’, be the simple cir- 


> Ct 
cuits in 6, where Cyo---C, correspond to state machines of P and 
CuyereeC, are simple circuits in 6 that do not correspond to state 

i i 1 
machines of P. Let Yyoee-¥is Yoel? °°? 2 be their respective fundamenta 


computation rates. For any circuit C, € {c,,..-¢} 


n 
i se 
bs 
y, M5715 
Vityy € Cc; 


Now, the fundamental computation rate p of the timed net y is given by 


‘= 
p min [¥,5---¥] 
= min [Y oeee¥ ae Yup te! 
or op’ S min (¥,.---¥] 
ry 
Note: 
There may exist some Ve € iY fanaest? such that 
< 
¥, < min {¥,...-¥ 1. 


Thus, while ,' = min [¥,.---¥ i] is certainly a bound, this bound may not 


be achievable. The computation rate Py of any transition t, is defined 
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OP, = PX ®. 
Once again, this value of Py ia a bound, but this bound may not be achie- 
vable. Theorem 5.2.1 enablea us to find a bound on the computation rate 
of any transition in a timed SM Petri net by finding for each state 
machine compon .nt 8. the corresponding fundamental comput at ion rate ¥ i 
This is simpler than finding the waximum* fundamental computation rate of 
all the c-equivalent nets for the 9@ Petri net. 
Examp le: Let us apply this result to the timed @@ Petri net of Figure 
5.1.3, Theorem 5.2.1 gives a bound on the maximums fundamental comput at ion 
rate equal te | 

min [ 2/15, 1/22] = 1/22. . 

By drawing all the c-equivalent nets for the G@ net, the regger can 
verify that the net of Figure 5.1.3¢e) has the largest maximum fundgmen- 
tal computation rate of 1/24. The bound of Theorem 5.2.1 capgot actua- 
lly be realized in any consiatent behavior af the SH net, but we see 
that it is certainly a reasonable bound. The reasca that the bound is 
not achievable is that there is no state machine component in # that 
correspenda to the multiply-labelled cirauit Pat ahah sPatabsts:--7Ps- 
Thig circuit has a weximum naturel fundsmental computation rate of 1/24. 
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5.3  Achievability of Bound on Computation Rate for timed SMD Petri nets 


The bound on the computation rate of transitions in a timed SMD Petri 
net that we have presented in the previous two Sections has been shown 
to be an upper bound on the computation rate for a given SMD Petri net 
with a minimal integer consistent current assignment. We have also shown 
that this bound may not be achievable, and in this Section we look at the 
issue of achievability. 

When the reader examines the expression for the bound, he will notice 
that it is always achievable for event graphs. Since the multiplicity 
of every transition in an event graph is one, this means that there exists 
a minimal integer consistent current assignment in which the current assi- 
gned to each transition is unity. Also, since a simple circuit in an 
event graph corresponds to a state machine componert if we view the event 
graph as an SMD Petri net, we can rewrite Theorem 5.2.1 for event graphs 
to read as follows: 
In a timed event graph 4 = (G,Q), the computation rate of all tran- 
sitions in the net is the same and is given by 

Pp ss min [¥,5---¥ I, 


where Cyre- eG, are all simple circuits of the event graph and 


M(O, ) 
fe C, : “i 
ies a eh 


T 


kl 
Yu, FG 


Ee 
it 


This is the same bound as the one we obtained in Expression [4.11] 


in Section 4.3. We also showed in Section 4.3 that this bound is achie- 


Min ree is setae 4 acer aetnan =: a hepiacmt—aairaranentteA Crime titers Se cer te ere nana ann 
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vable. Thus, the result in Theorem 5.2.1 represents an achievable bound 
if the SMD Petri nets we consider are restricted to being event graphs. 
Our question is « how good a bound is it for more general SMD Petri nete? 
The largest class of SMD Petri nets we have been able to preve achieva- 


bility for is the class we term a+SMD Petri nets. 


Definition 5.3.1 An (G-minimal integer consistent current assignment is 


one in which all transitions with: more than one input place are assigned 
unit current. 
Definition 5.3.2 An SMA Petri net is a@-SMA iff there exists an o-mini- 


mal integer consistent current assignment for it. 


Lemma 5.3.1 Let € be the c-equivalent net of an Q-SMA Petri net P for 
the Q-minimal integer consistent current assignment $. ‘Then, every 
circuit Gq. in @ corresponds to a state machine component s,. in? and 


vice versa. 
Prot = 


Consider a state machine component §, in P. wet ¥ bea cyclic frus- 
trum for & . We know that the initial slice of # mist contain at least 
one place from every state machine component ( follows from Theorem 
3.5.1). Choose some place Poy E S. from the places in the initial 
slice of #7. When two consecutive instances of this place in a cyclic 
frustrum for the given current assignment are considered, every state 
machine containing Py; must unfold into a chain that begins and ends at 


Py ,° Im the c-equivalent net constructed from the cyclic frustrunt, each 


chain corresponds to a circuit. 
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=>, Consider the cyclic frustrum for the given c-equivalent net. 
Since Pis SMA, every closed simple directed path is an SSM. Also, 
since every transition which has multiple input places has unit 
current, there is exactly one instance of each such transition in 

a cyclic frustrum. Every allocation reduction on the c-equivalent 
net results in exactly one SSM (a multiply-Labelled circuit) for each 
SSM in P. 

Suppose some closed directed simple path does not correspond to any 
SSM. Then it must correspond to some closed structure in which there 
exists a transition t with multiple input places. The only way in 
which such a closed structure could map into a circuit is for there to 
be more than one instance of it in a cyclic frustum, which is the de- 


sired contradiction (see Figure 5.3.1). 


II 


Multiple Instances of t, 


Figure 5.3.1 


As a consequence of Lemma 5.3.1, we have the following theorem: 
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Theorem 5.3.1: The bound on the computation rate of transitions 
attained in Theorem 5.2.1 holds for every timed & - SMD Petri net 


x = (PQ) 


Proof: By Lemma 5.3.1, the c-equivalent net €@ for an q- SMD Petri net ? 
is a multiply labelled event graph in which every simple cireuit 
corresponds to a state machine component of P and vice verea. By 
attaching firing times to each transition in P, the desired result 


follows. 


Example: Figure 5.3.2(a) gives an example of a timed q@ - SMA net, 

shown with a consistent current assignment. Figure 5.3.2(b) gives 

a c-equivalent net which realizes the computation rate given by 

the expression of Theorem 5.2.1. ‘The reader will note that each state 
machine in the aw - SMD Petri net of Figure 5.3.2(a) maps into a multiply 
labelled circuit in Figure 5.3.2(b) and vice versa. 

In Figure 5.3.3(a) the timed SMD Petri net of Figure 5.3.2(a) is shown 
with a minimal integer consistent current assignment in which transition 
ty, has a current equal to 2. This net is no longer q - SMA, and the 
bound of Theorem 5.2.1 can no longer be achieved, as can be verified 
. by examining its c-equivalent net which has the largest maximum 


fundamental computation rate. 


Bound from Theorem 5.2.1 


ae oS eee 
Pp” = min 20° 25 25 
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3 
Current Assignment Firing Time Assignment 
9, = 2 T= 3 
Po = 2 To = 5 
= 1 _ 
P3 T3 4 
Py, = T, = 2 
? = 4 us = 7 
Pe = 4 us = 8 
P, = 1 os 


Figure 5.3.2(a) 


Figure 5.3.2(b) 
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Po 
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1 


2 


2 
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Figure 5.3.3(a). 


Firing Time Assignment 


ies 
7, = 4 
2 
ame 
Te 8 
T= 3 
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Figure 5.3.3.(b) 
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Bound for c-equivalent net of Figure 5.3.3(b) 


teamin [55S * oe Fo = de 
Pp 20 25 26 19 26° 


Thus, the bound from Theorem 5.2.1 is not achievable by any c-equivalent 


net for the given timed SMD Petri net. 
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Section 5.4 Review of Results Obtained 


In this Section we will put our work in perspective and will point 
out its relationship to PERT networks [Fl] and the program flowcharts 
of Martin and Estrin [Ml, M2, M3]. 

In Chapter 2 we have argued that practical systems can be viewed 
as an ensemble of interacting components and that each system component 
can be viewed as a state machine. Subsequently, we investigated the 
modelling of such systems using Petri nets. We introduced SMD Petri 
nets as being the class of Petri nets which can model practical systems. 
The type of systems we wish to model do not have any redundant functional 
operators, and are free of deadlock. This translates to the problem of 
choosing only those SMD Petri nets which have a live marking. We have 
pointed out that we still do not understand how the state machine 
components of an SMD Petri net should be interconnected to ensure that 
the net has a live marking. Currently, the largest subclass of SMD 
Petri nets we know of with this property is the class of SMA Petri nets. 

Since any live marking for an SMD Petri net P is bounded, a consistent 
current assignment can be made to the transitions in fp. By multiplying all 
currents in a consistent current by the least common multiple of their 
denominators and dividing them by the greatest common divisor of their 
numerators, we get the minimal integer consistent current assignment. 
The current associated with a transition in a minimal integer consistent 
current assignment is the multiplicity of the transition in a c-equivalent 
net of the SMD Petri net for the given current assignment. 

We have also looked at the entire class of LSP Petri Nets (i.e., 


Petri nets which have a Live, Safe,Pergistent marking). Even though 
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the LSP Petri nets of interest are likely to be SMD, there do exist 
LSP Petri nets that are not SMD. A steady state equivalent net exists 
for any LSP Petri net. Furthermore, the Steady state equivalent net of 
an LSP Petri net is unique, and corresponds to the c-equivalent net 
for an LB SMD Petri net with a minimal integer consistent current as- 
signment. 

Figures 5.4.1 and 5.4.2 are Venn diagrams which exhibit the relation- 
ship between the various subclasses of Petri nets that have been con~ 


sidered in this thesis. 


SMA Petri nets 


State 
Machines 


Figure 5.4.1 
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LaF 
Petri nets 


Petri pets 


Shaded areas represent LSP Petri nets. 


(a) 


Figure 5.4.2 


We have introduced timed Petri neta in order to model the finite 
speed of operation of practical systems and have shown how to find the 
computation. rate of transttions in LB SMD Petri nets and LSP Petri nets. 
For both kinds of timed Petri nets, the maximum computation rate of a 
transition is the fundamental computation rate of the timed c-equivalent 
net for the timed Petri net multiplied by the multiplicity of the 
transition in the c-equivalent net. 

In order to find the maximum fundamental computation rate of a timed 


c-equivalent net (or a steady-state equivalent net for an LSP Petri net), 
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TIMED LSP 
NET 


Maximum compu- 
rate of transition 
t= maximum computatio 
of timed c-equivalent net 
with largest computation 
rate. Use Theorem 5.2.1 to 
obtain a bound. 


Maximum compu- 
tation rate of tran- 
sition t, = maximum 
computation rate of 
timed steady-state equi- 


valent net. 


TIMED SMD PETRI 
NET 


TIMED STEADY-STATE 


ery steady-state equ- 
ivalent net is a 
‘multiply-labelled 
event graph. 


Every c-equivalent\ 
net is a multiply 
labelled event 

graph. 


TIMED C-EQUIVA- 
LENT NET 


Maximum Comput ation 
rate of transition t 

1 i 
= u(t,) xp 


TIMED MULTIPLY- 
LABELLED EVENT 
GRAPH 


To find Pr. 


TIMED EVENT 


GRAPH Find fp by applying 


Equation [4.11]. 


Figure 5.4.3 
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we assume that all transitions are dictinctly labelled, and we find the 
maximum computation rate of the resulting timed event graph. The maximum 
computation rate of a timed event graph can be found by applying 
Equation [4.11]. 

We illustrate the various terms used and the relationship between 
the computation rates of timed event graphs, multiply-labelled event 
graphs, LSP Petri nets and SMD Petri nets by means of Figure 5.4.3. 
The diagram is self-explanatory and shows that the bound of Equation 
[4.11] forms the cornerstone of our work, the fundamental computation 
rate of timed SMD and LSP Petri nets being obtained by finding the funda- 
mental computation rate of an equivalent multiply-labelled event graph. 

We are now in a position to point out how our work relates to the 
following models of parallel processing and parallel computation: 

(a) PERT networks 

(b) Martin and Estrin Flowcharts. 
Let us begin with PERT networks [Fl]. A PERT network consists of an 
acyclic directed graph with an input vertex and an output vertex. All 
arcs in the network lie on paths from the input vertex to the output 
vertex. Each arc denotes an activity in the project being modelled 
by the network, and each activity takes a certain amount of time to 
occur. We can model a PERT network as a timed acyclic event graph by 
replacing each arc with its two end vertices by the structure shown in 
Figure 5.4.4. The time associated with the activity ab is now associated 
with the timed transition tab introduced into the arc ab. Transitions a 
and b are assumed to have zero firing time. In Figure 5.4.5 we show 


this transformation carried out on an example PERT network. 


aaa 
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(a) A PERT Network. 


y a OS 
: ENG, 


(b) Equivalent Timed Acyclic Event Graph. 


Figure 5.4.5 
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ll 


Figure 5.4.6. 
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Suppose we draw an arc from f to a in Figure 5.4.5. The resulting 
graph is a strongly-connected event graph, whose critical circuit 
corresponds to the critical path of the PERT network. In Chapter 6 
we will see how strongly-connected timed event graphs can be applied 
to many practical situations where PERT networks are inadequate. 

We now turn to the program flowcharts of Martin and Estrin [Ml, M2, 
M3]. These are directed graphs consisting of arcs and nodes. Nodes 
represent operations in a computer program and arcs represent data 
paths between them. The presence of a data value on a data path is 
represented by a marker. Each node has logical conditions (and, 
exclusive-or) on data values on the input arcs that must be satisfied 
before the node can act. The action of a node consists of removing a 
data value from the specified logical combination of input arcs, per- 
forming a computation,and then depositing data values on the specified 
logical combination of output arcs. The action of a node is assumed to 
take some finite amount of time. In Figure 5.4.6 we show how to model 
these timed nodes by means timed acyclic Petri net structures. A Martin 
and Estrin flowchart models program constructs like decisions and 
iteration together with aspects of parallel programs, like the fork and 
join operations [D6]. A test is performed to check if the operation of 
the flowchart can proceed in a deadlock-free manner; if an arc is drawn 
from the output node to the input node, this test turns out to be similar 
to the one for deciding if an SMD Petri net is SMA. Our work can be 
viewed as modelling cyclic or recurrently acting systems, while the 
work of Martin and Estrin is concerned with finding the mean execution 


time of parallel programs. The two pieces of work taken should provide 


-180- 


«a good bag of tools for the analysis and design of asynchronous 


computer systems. 
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CHAPTER 6 


APPLICATIONS OF TIMED PETRI NETS TO THE MODELLING OF ASYNCHRONOUS 


CONCURRENT SYSTEMS 


Deterministic Systems 

We will now consider applications fer our work drawn from diverse dis- 
ciplines such as cemputer systems modelling and operations research. Through- 
out this discussion, we will keep in mind the distinction we have made bet- 
ween deterministic and non-determinietic systems. We begin by presenting 
several types of deterministic systems. The simplest system we wish to con- 
sider is a set of adder units whose action can be represented by the timed 


event graph shown in Figure 6.1.1. 


perform addition 


T3 Output result 


Operand 2 Pick if 
6perand pair 


Figure 6.1.1 


Each token on the circuit represents a physical hardware adder unit which 
can be in one of three states shown (i.e., "ready to pick up operand pair", 
“ready to perform addition" and “ready to output result"). We will make 


the assumption that there is always an operand pair available in the 
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input queues, so that a hardware adder unit never has to be idle for want 
of operands, i.e., the environment does not introduce delays into the 


operation of the system. Applying Equation [4.11], the maximum rate at: which 


results appear in the output queue ia given by n/ ¢ T +7) + T,) 
where nis the number of physical hardware adder units, 
Ty> T, and 7, are the times required te perform the actions 


modelled by transitions ty ty and t,. 


Let ue aseume some values for the above parameters. 


T = 500 nsec. 


1 
= 2 « 
"9 
~ = 500 nsec. 


The maximum throughput rate of the adder is then 4/3 ¥ 1.33 million 
additions per second. 
A more interesting example is the timed Petri Net model of a three-stage 


pipelined floating point adder shown in Figure 6.1.2. 


Figure 6.1.2 


bbe 
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This pipelined adder has only one hardware unit per stage, ao that there is 
overlapped operation of the stages without any parallelism in each stage. 
The maximum throughput rate of the system is seen to be 
-min [ L/t, , l/t,, Utd. 

Let us examine the above expression. Suppose 1/T We tha eualient oF eRe 
three quantities in parenthesis. Then stage A has the slowest hardware 
unit, and the natural computation rate of this stage determines the through- 
put rate of the system. This means that having hardware units in stages B 
and C that are faster than the hardware unit of stage A is wasteful, since 
their added speed does not result in any extra system throughput. We will 
say that a determénistic system is balanced if the natural computation rate 
of all system parts is equal. In the context of the pipeline adder in Figure 
6.1.2. this means that i = urs = Tee 

A more complex pipelined adder would be one in which there are multi- 
ple hardware units in each stage. Such a pipelined adder is shown in Figure 


6.1.3. 


my, tokens 


Stage A Stage B Stage C 


Figure 6.1.3 
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Stages A,B and C have n> and a hardware functional units, respectively. 
In this case, the throughput of the system is given by 

nin [ n/t n/t, r n/t]. 
For the system to be balanced, we see that n/t = a/t, = 2/t: 
Thie implies the following: 

If a stage consiste of slow hardware functional units, a proportionately 
larger number of units should be present in the stage to ensure that the 
system is balanced. Absence of system balance implies that there are parts of 
the system with excess capacity that cannot be used. Our results can thus 
be used to test for system balance in determiniatic gysteme, and they serve 
to formalize the intuitive netions of balance that a herdware designer, 
would, no doubt, use in the deaign of pipelined syatems. 

PERT Networks and Project Scheduling 

PERT charts are used in Project scheduling to determine the shortest time 
that it takes for an eneemble of comcurrent activities te complete, given 
the precedence relationships between the activities and the time duration of 
each activity. We examined the relationship between PERT charts and timed 
Event Graphs in Section 5.6. We now see that by using timed event graphs, 
we can model aspects of project sched&éling end assembly lines that are not 
within the power of PERT charts. The two main advantages of timed event 
graphs over PERT charts is their ability to: 
(a) model systems that act recurrently. 
(b) model physical resource units explicitly. 
Suppose we consider the project represented by the PERT chart in’ Figure 
6.1.4. The project consists of activities “a" through "g" with the prece- 
dence constraints expressed by the PERT chart. We begin by drawing the 


equivalent acyclic event graph for this PERT chart , using the method given 
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in Section 5.6. This acyclic event graph is shown in Figure 6.1.5. 


Zee 


Figure 6.1.4 A PERT Chart. 


Figure 6.1.5 Equivalent Event Graph. 


Now suppose we want the PERT chart in Figure 6.1.4 to represent a manufac- 


turing process, where t 1 represents the start of the process, and to the 


end. We assume that each activity in the system requires the use of a 


unit of resource, like a lathe, a milling machine, etc. We wish to express 


on emt ee ~ ee ais ASG, GesAe so Be 
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the fact that there are only a finite number of units of each resource type. 
Also, some resource unite may be very expensive and may have to be shared 
among several activities (e.g. a high precieion lathe). How are we to ex~ 
press such system constraints using a PERT chart? The answer is that there 
is no way of doing this without augmenting the structure of PERT charts. Let 
us see what added descriptive power can be had by using timed event graphs. 
Around each of the transitions te through a we draw a loop. Each loop is 
marked with a number of tokens equal to the number of physical processing 
units available for the corresponding activity. The resulting event graph 


is shown in Figure 6.1.6. 


Figure 6.1.6 Event graph which models limited resources available 
for each activity in PERT chart. 


In addition to the transition representing an actiwity, each loop consists 
of a second transition, which we will interpret as the act of allocating 
a resource unit to the activity. Since we wish to model a recurrently acting 


production facility, we complete the loop between t and t, and add a large 


number of tokens to place p as shown in Figure 6.1.7. One of the self-loops 
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Figure 6.1.7 


(i.e., loops around each of the tramitions t,...t, now becomes the criti- 
_ eal circuit. The system can be balanced by assuming that the natu- 
ral computation rate of each of these circuits is the same, i.e. 
BRT gg REACT TE TG g)s OR, AC a? 
~ The n's denote the number of tokens on the corresponding circuits. If all 


the n's are infinite, then the processing capacity of the production facility 


becomes infinite. This is the situation represented by Figure 6.1.5. Since 
any production facility in real life has only finite. resources available 
to it, these resources can be explicitly represented as we did in Figure 


6.1.6. 


gn ee a 


A word now about adding an. infinite number of tokens to the loop formed 


by joining ty to t, through place p. Each token represents the possibility 


rr on ret ne sea Ranh Gceck 55k, Bet wierd wats 
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of a set of subassemblies being input to the production facility for process- 
ing. Our assumption that we can add an infinite number of tokens to place p 
is equivalent to saying that there is an unbounded amount of buffer capacity 
between the activities t. ane oe In practice the amount of buffer space 
(in the form of storage bays) between work stations in a production process 
ig bounded, and this would place a limit on the number of tokens we can put 
on place p. In that case, we would have to consider the natural computation 
rate of all circuits in Figure 6.1.7 including circuits like 


t,t tttet tr tgtet tit etc. 


fabcoi’ idfgoi’ 
Let us give an example of how the buffer capacity of the system may affect 


the maximum throughput rate of the system. We do this by considering 


some actual values for system parameters. 


Let T Sees = = 0. 
a 


2 g2 
| =z 3 n * 2. 
Th * 2 » * 1. 
Tas 3 4 nm 7 2. 
Ta =O” 6 n° 3. 
T°? 5 2 2. 
Te 27 8 n, * 3. 
TH = 3 sa = 2. 


Case (i) We assume that M(p) (i.e., the number of tokens on place p) = 6. 
The throughput rate of the system = min[ 2/3 1/2, 2/4, 3/6, 2/5, 3/8, 
2/3, 6/9, 6/11, 6/17] = 6/17. 


We see that circuit t,t tt tit limits the throughput rate of the system. 


idfgoi 


Case (ii) Now let M(p) = 100 (i.e.,"very large"). 


. 
e 
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The throughput rate of the system becomes min [ 2/3, 1/2, 2/4, 3/6, 2/5, 
3/8, 2/3, 100/9, 100/11, 100/17] = 3/8. 
The loop around transition te now becomes the bottleneck in the system. 
The production facility we have considered so far was arranged in such a 
way that each activity has its own set of resource units available to it. We 


now consider the issue of resource sharing. 


Resource Sharing 


Figure 6.1.8 


Consider the production process shown ifi Figure 6.1.8. Once again, 
letting Maree eT, denote the number of tokens. on the loops around transi- 
tions to Save t. respectively, we assume the following values for the parame- 


ters of the system: 


T = Tv = .e.e = = ‘ 
a2 b2 Te? 2 
vA = 6 7 1. 
Th = 2 n, = 1. 
Tv = = 

=i 5 a, 1. 
q = = 

al 10 m4 2. 


4.5 n = 
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We make the assumption that the same type of resource is used by both 
activities e. and te In Figure 6.1.8 , each activity has its own resource 
unit. In this configuration, the system throughput is 

min [ 1/6, 1/2, 1/5, 2/10, 1/4.5 ] = 1/6. 
We see that activity ae is the bottleneck in the system. Since activities 
e. and t. use the same type of resource, it is possible to pool their 
resource units together . Now whenever resources are pooled together, some 
resource allocation strategy must be adopted to ensure fair resource allo- 
cation to the contending resource users. Since event graphs can model only 
deterministie systems, we will use a simple strategy in which each resour- 


ce unit is allocated alternately to the two activities. The resulting 


system is shown in Figure 6.1.9. 


Figure 6.1.9 


We assume that all parameters of the system are the same as before; this 
time the throughput of the system is 


min [ 2/8. 1/5, 2/10, 1/4.5 ] = 1/5. 
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Thus, the activity to is no longer the bottleneck in the system. It 
should be noted that other resource allocation strategies are possible, 
but they result in non-deterministic nets. 

So far, the system models we have considered have all been timed 
event graphs. We now give an example of a system which is modelled by 
an LSP Petri net. 

Figure 6.1.10 is a timed LSP Petri net model of the assembly line 
that alternately outputs two types of automobiles, which we discussed 
in Section 1.2. 

The maximum computation rates of transitions t, and te are of interest 
because they tell us the maximum rate at which automobiles are output 
by this assembly line. These computation rates can be found from the 
timed steady state equivalent net of Figure 6.1.10(b). 

We now suppose that there is parallelism within some of the assembly 
stages, but that automobiles are still manufactured alternately. This 
can be modelled by adding tokens to places in the net of Figure 6.1.10(a) 
other than Ps and Pg» and we get an LBP Petri net of the type shown in 
Figure 6.1.10(c). 

The reader will recall from Section 3.4 that in order to draw the 
steady state equivalent net of an LBP net that is also LSP, tokens are 
removed until a life, safe marking results and then drawing the steady 
state equivalent net for it. The marking is constructed by the technique 
given in Section 3.4. We get the steady state equivalent net of Figure 
6.1.10(d). Once again, the maximum computation rate of transitions k, and 
k, can be found. 


6 


The system models we have considered so far have all been determinis- 


Figure 6.1.10(a) 


200 HP 


Output queues 
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Figure 6.1.10(b) Steady State Equivalent Net 


Figure 6.1.10(c) 
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Figure 6.1.10(d). Steady State Equivalent Net of the LBP Petri 
Net of Figure 6.1.10(c) 
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tic Petri nets. We now turn our attention to the modelling of non- 
deterministic systems. . 

The simplest example of a non-determiniatic system that we wish to 
model is the pipelined processor with decisions af the type discussed 
in Section 1.2. A timed SM Fetri net wodel for this is given in 
Figure 6.1.11. 


Figure 6.1.11 


This net has five state machine components, as the reader can easily 
verify (indicated as Stage 1,...,Stage 5). Two types of instructions 
can be processed, Type A and Type B. Let usa suppose that the relative 
frequency of these two instruction types is in the ratio of 2:3 (this 


can be found by statistical analysis of program traces). This leads to 
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the following minimal integer consistent current assignment: 


aS) 
A 
1 
aS) 
un 
6 
Fh 
I 
6 
~s 
HT] 
w 


The computation rate Py of transition ty gives the maximum rate at 
which instructions are absorbed from the input queue. Similarly, the 
computation rates D6 and P, give the maximum rates at which results 
are placed in the output queues. To calculate these quantities, we 
first find p'. From Theorem 5.2.1, 
(aan . 

p min [ Vy> Vos Ve> Vas ve] 
where each of the terms represents the fundamental computation rate of 
the corresponding stage in the system. System balance requires that 


Let T. be the firing time of transition t, where i = a,...,;f. Then, we 


have 
1 
o' = min{f ———, etc....] 
Cala 
= min : : 
= Dn aaa RY 45 SIL, 1 gi > 3T 
a c d £ 
= T = T = T,= T = Tle 
Suppose we choose re b . d Z £ 1 usec., 
then p' = 1/5 and the system is not balanced. 


A balanced and more economical system results when Le = 2.5 uy sec., 


Te = 1.67 uw sec., the other parameters being the same. Under this new 


firing time assignment, 
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p* -4 = 2x 10° instructions/sec. 
therefore, : 

Py = 5X p* ; = 10° instruct ions /sec. 

Oe = 2X 2x 10° = 4X 10° instructions/sec. 

p, = 3X 2x 10° = 6X 10° instructions/sec. 


The reader can construct further examples based on processors in which 
there are multiple hardware units within each stage, and can work out 
a method for an-optimal design based on certain objective and cost 
functions. We shall not attempt to do this here. 

The SMD Petri nets we have exhibited so far have mainly been models 
of pipelined processors or assembly processes. Let us now look at 
models for interacting cyclic processes in computer systems. We begin 
with a model for two processes that interact with each other through 
mailboxes. Figure 6.1.12 gives a schematic or a flowchart model for 
such a pair of processes. One of the processes is deterministic (or 
decisionless) and the other one has two decisions in it. The processes 
communicate by passing messages to each other through mailboxes, and 
the reader can convince himself that they can operate concurrently 
without deadlocking. This can be verified formally when the Petri net 
model for this system shown in Figure 6.1.13 is examined. The net is 
seen to be SMA, and in Figure 6.1.14 we indicate its state machine com- 
ponents. Since the marking shown puts at least one (in this case 
exactly one) token on each state machine, the marking is clearly live. 
This, as the reader will recall from Chapters 2 and 3, means that the 
system of processes has no redundant operators, and furthermore, that 


the processes can operate without being deadlocked. 
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Figure 6.1.12 
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Figure 6.1.13. 
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op 9 


op 19 


op 10 


Figure 6.1.14(a) Figure 6.1.14(b) 
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Figure 6.1.14(c) 
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op 5 


_op 19 


op 14 


Figure 6.1.14(d) 


Having thus established that the Petri net in Figure 6.1.13 has a 
live marking, we can now suppose that the transition labelled "op 19" 
represents an operation which outputs a result every time the transition 
fires. Our problem now is to find out the maximum rate at which op 19 
outputs results, given the usual parameters like the firing times of the 
transitions and a minimal integer consistent current assignment. The 
places marked “op 8" and "op 16" represent decisions, each decision 
having two outcomes. The relative frequency of the outcomes of each 
decision can be found by statistical means. Let the probabilities of 
op 11 and op 9 be Pil and Py respectively, where Pil + Pg = 1. 

Similarly, let Pig and Pa7 be the probabilities of op 18 and op 17 


respectively, such that Pi7 + Pig = 1. We can now obtain a minimal 
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integer consistent current assignment for the net, where each current is 


denoted as usual by 9, and 


f= Par [6.1] 

Pq Po | 

a = Pig [6.2] 
a Pay 


As an example, let Py, 7 1/3 Py * 2/3 


= 3/4 Py * 1/4 


Then, the following is a consistent cyrrent assignment: 


9, = 1 %= 1 9, = 1 %, 71 9 = 1 


win 


= %; = J = ig = ge = 
- = i- 
711 ?12 3 


1 


% * Pio 


From this consistent current assignment, we can derive the following 


minimal integer consistent current assignment: 


9,712 9-12 = 12 4712 gr 
% *10 %) * 3 4 " § 


= 12 %.715 %,=3 ?,, = 12 
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The maximum computation rate P49 = p' x 919 = 12p', where p' can be 
determined by applying Theorem 5.2.1. 

The example we have worked out clearly shows the utility of 
Theorem 5.2.1. As we have pointed out in Section 5.2, an achievable 
bound can be found for 9,9 by drawing the timed c-equivalent nets for 
the Petri net in 6.1.13, and finding the c-equivalent net with the 
minimum value of fundamental computation rate. This method is not 
easy and Theorem 5.2.1 gives a far more tractable method, although the 
bound so computed may be overly optimistic, since the Petri net is not 
@ - SMA. 

The timed event graph of Figure 6.1.9 models a production facility 
in which a deterministic resource sharing strategy was used to share 
and T,,. We pointed out that in 


1 b1 


order to model non-deterministic resource allocation strategies, we 


resource units between activities ee 


need SMD Petri nets. We show such a system in Figure 6.1.15. 
A minimal integer consistent current assignment is one which assigns 
unit current to each transition. i.e., 
Rao pg Ra Sea Bee eg ga es 
By decomposing the net into its state machine components, we see that 


only one state machine component has changed, namely the one containing 


i 
the resource pool and the transitions to to thi and thee The maximum 
2 
fundamental computation rate of this state machine is 7+ 7.41 4h? 
al a2 bl b2 


which is the same as that of the simple circuit t in Figure 


a2 fal th 1 
6.1.9. Thus, the non-deterministic strategy does not change the compu- 
tation rate of the system. This can be seen to be true for deterministic 


production facilities, as they have a fixed minimal integer consistent 
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Figure 6.1.15 The Production Facility of 6.1.9 with a Non-Deterninistic 


Resource Allocation Strat é 


current assignment. The reader is invited to construct further examples 
for himself. 

We conclude this chapter with an example of an ensemble of sequential 
processes which contend for shared resource units in a resource pool. 
Figure 6.1.16(a) is an SMD Petri net model of a system of three sequen- 
tial processes contending for two pooled resource units. The minimal 
integer consistent current assignment models the fact. that the relative 


frequencies with which Processes 1,2 and 3 are allocated resource units 
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Process 1 Process 2 Process 3 


Figure 6.1.16(a) 


are in the ratio of 1:2:3 (this is a coincidence). The net is SMA, 
and the four state machine components designated SyreeesS, are shown 
in Figure 6.1.16(b). 

By applying Theorem 5.2.1, the reader can easily find the computation 
rate of any transitions of interest. 

It is hoped that the material presented in this chapter has given the 
reader a good overview of the applications of the work presented in the 
earlier chapters. The reader should reinforce his understanding of this 


material by constructing further examples of his own. 
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Figure 6.1.16(b) 
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CHAPTER 7 


This thesis has answered several questions but has opened up many 
others. Basically, we have laid to rest the issues we raised in Chapter 
1. We have shown how to model asynchronous concurrent systems in an eco- 
nomical fashion, and, by augmenting our model with timing information, we 
have shown how to find bounds on the computation rate of a large class of 
systems. 

Our approach in establishing the validity of the SMD Petri net model 
for asynchronous systems has been heavily dependent on example and 
intuition. This may seem rather unsatisfying to some of us, and we pose 
the following problem to our readers: can we come up with a set of axioms 
that specifies the structure and behavior of asynchronous systems, and then 
show that these axioms lead to SMD Petri nets? 

SMD Petri nets have turned out to be a very rich class of structures 
for representing asynchronous concurrent systems. However, we do not have 
any necessary and sufficient structural conditions for an SMD Petri net 
to have a live marking, and we pose this as another problem to our rea- 
ders. 

The other major question that remains to be examined is to assume 
firing times to be random variables in order to model real-world systems 
more accurately than is possible with the model of Chapter 4. Good bounds 
are needed for the mean computation rate of such timed nets. Another issue 
that seems to merit some attention is to assume that input queues to 
an asynchronous concurrent processing systems are fed by items whose 


arrival rates are random (e.g., Poisson). This will lead to a statis- 
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tical fluctuation in the processing rate of the system, and the effect 

of buffers in smoothing out these fluctuations can be examined. A general 
performance analysis theory can ther be worked out for asynchronous 
comecurrent systems. We think thei our thesis is a step in that direc- 


tion, 
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APPENDIX I 


In this appendix, we show that the rooted tree T(V) for any vector 
addition system V is finite, and the proof of Theorem 2.3.1 is given. 
The results and techniques in this appendix are taken from Karp and 


Miller [K2]. 


Finiteness of the Tree T(V) 


To prove that T(V) is finite for any V requires two lemmas. The 
term subsequence used here does not refer necessarily to successive 
elements of a sequence. Thus 1, 3, 4, 15, 79,... is a subsequence of 
1, 2,3,.. 2. 


Lemma I.1: Let s yerreSyeree be an infinite sequence of elements 


o?§ 
r Paar F 
from (N U {W})” for some positive integer r. Then there exists an 
infinite subsequence s, ,S, ,...,S, ,... such that s, Ss, <...S5 8S, «.. 
i,? i i i 
1 2 n 1 2 n 


Proof: In s »-». there exists an infinite subsequence that is 


o?8y9°°°S, 
non-decreasing in the first element. In this sequence in turn, there 


exists infinite subsequence that is non-decreasing in the second element, 


etc. 


Lemma I.2: (Konig Infinity Lemma [K3]. Let T be a rooted tree in which 
each vertex has only a finite number of successors and in which there is 


no infinite path directed away from the root node. Then T is finite. 


“po ae TE Niet fe tard ee Ef, ~ 9 2 a tl I ee 
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Theorem I.1: For any vector addition system V, the tree T(V) is 


finite. 


Proof: Assume otherwise and let By 2So0e0eS sees be a sequence of 


nodes in an infinite path directed away from the root node. By Lemma 
I.1 there must exist an infinite subsequence 8, 5 a aia »--. of this 
1 n 
sequence such that 4(s, )s £(s, }... S 4(s, )} < ... Since none of 
1 2 


n 


these nodes is an end, it can never happen that £(s, ) = Ls ). Tf 
n 


i 
ntl 
this were not true, then the path would be finite by condition 2(a) in 
the definition of T(V). 
From condition 2(b) in the definition of T(V), £(s, ) must have at 
n+l 


least one more element equal to “ than £(s, ) does. Since the number of 


i 
n 


elements is finite, we have a contradiction, and, therefore, no such 
infinite path can exist. From Lemma I.2 it must be the case that T(V) 


is finite. 
Proof of Theorem 2.3.1: For any vector addition system V and any integer 
vector x of the same dimension 


@y € R(V) such that x < y) @ ( 9 B € T(V) such that x < £(B) ). 


Proof: We first show that the right hand side implies the left hand 


side. The idea of the proof is that, if B is a node in T(V), then there 
are vectors in R(V) which agree with £(B) in its finite elements, and 

can be made arbitrarily large in the elements equal to \ by repetition of 
the sequence of vectors which led to the occurrence of ©. The details of 


the construction involve some calculation. 
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Suppose x < £(f8). Let the path from s to B have the successive 
nodes Lu ea where s = No and B = Te For j = 1,2,...k, let 
¥y be the vector associated with the arc directed into 1],; i.e., 
uF = M5 -Dy - Assume without loss of generality that the first h 
components of £(B) are equal to “, and that the other components are 
less than ¥. Assume further that, in the path from s to B, “'s are 
introduced in the order 1,2,...h. Then, for each ii, 1S ish 
there exists a consecutive subsequence t, = Yeci)? Yet) 41°" Vai) 
such that the vector us = Ve(i) + Vocajeles** + Vaca) is positive in 
the i-th element and nonnegative in the i+] st through nth elements. 
Note that t, is the subsequence that “accounts for" the i-th ©. 

Let - be a lower bound on all the (negative) elements of Uy seeesth« 


Also let {ny,-++sm} be any set of non-negative integers satisfying: 


> (x- 
ny (x 8), +a(h +L + ny + n, + .eet n,) 


ny 2 (x-8), + x(h + n, +n, + +--+, ) 


(1) 


<3 
N 


(x-s), +ru(h+2-it M417 ---+n)) 


Th 2 (x-s)) + 20 
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Such a set must exist because of the triangular form of the inequali- 
ties. 
s 
Choose $4 2Soo0ee Shay such that for 1 i<sh, 84 89-+-85 is the 
prefix of VyVo-0 eV, UP to the first occurrence of | in the i-th 


element, and $489°°*Sp ay = VyVo-°eVy- Then the sequence 
§ = 81t, Baty 22-8, t Shy TF Uy + Uy hae the following 
properties: 


+u 2x 


t g°°cUs 
(b) each partial sum s + u, +.-.u; is non-negative. 


(a} s+u 


We omit the detailed derivation of (a) and (b)} from. the system of 
inequalities (1). To show that the left hand side implies the right 


hand side, suppose that the following are truer s + a, + Wye + ee € R(Vv), 


xSs + Bw) + 2.0Ugs and s + uy, +...% a 20, m= I,2,...f, where the 
{u Jare elements of W. Apply the following operation to the sequence 


s, 8 + uy,» 8 + uw, + Ups++-8 + Uy +... + u, as many times as. possible: 


£ 

Find the first member of the sequence (call it u') such that, for 
some earlier member u'', u'' < u', 

(a) If u'' = u', then delete all members following u'; 

(b) otherwise, for each i such that Cut"), < (u"),> replace the i-th 
element of u* and of each vector beyond u’ in the sequence by . 

It should be clear that the sequence obtained at the conclusion of 
this process is the sequence of labels in some path directed from 
the root of T(V), and that the final label in this sequence is a vector 
greater than or equal to s + uy +... + u,. Hence, the left hand side 


f 
implies the right hand side, and the proof is complete. 
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To illustrate the construction of the sequence & given in the 
first part of the proof, the following example, due to Karp and 
Miller, is provided: 

Suppose s = (1,1,1,4) and W = {(0,0,0,-1), (2,-1,0,0), (-1,1,0,0), 


(-1,-3,4,0)}. Consider the following path in T(V): 


(1,1,1,4)C2%O-1, 44,153) 
2,-1,0,0 
(3,0,1,3)—eO29, 11,3) 
-1,1,0,0 


(0,01, 3) 22324, ww, 3) 


so ? (0,0,0,-1), (2,-1,0,0), (-1,1,0,0) e = (2,-1,0,0), (-1,1,0,0) 
85 = (-1,1,0,0) t, = (-1,1,0,90) 
8 = (-1,-3,4,0) t, = (-1,-3,4,0) 


Take x = (22,16,9,3) < (W,W,w,3) and let a = 3. The system of 
inequalities (1) for this case is: 
2 
ny > 214+ 34 +n, + a,) 
= 
2, 15 + 3(3 + n,) 
By =8+3.2 


A solution is: n, = 14, ny = 66, n, = 273, giving the sequence 


3 
§ = (0,0,0,-1), (2,-1,0,0), (-1,1,0,0), ((2,-1,0,0), (-1,1,0,0))2”3, 
(-1,1,0,0), (-1,1,0,0)°, (-1,-3,4,0), (-1,-3,4,0)!4, which 


establishes that the point (193,23,61,3) = x is in R(V). 
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